PCT 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




(51) International Patent Classification 6 : 




(11) International Publication Number: 


WO 96/23073 


C12N 15/75, 15/65 


Al 


(43) International Publication Date: 


1 August 1996 (01.08.96) 



(21) International Application Number: PCT/DK96/00038 

(22) International Filing Date: 23 January 1996 (23.01.96) 



(30) Priority Data: 

0083/95 
0799/95 



23 January 1995 (23.01.95) DK 
6 July 1995 (06.07.95) DK 



(71) Applicant (for all designated States except US): NOVO 

NORDISK A/S [DK/DKJ; Novo A116, DK-2880 Bagsvaerd 
(DK). 

(72) Inventor; and 

(75) Inventor/Applicant (for US only): J0RGENSEN, Steen, 
Trocls [DK/DKj; Novo Norilsk a/s, Novo Alle\ DK-2880 
Bagsvaerd (DK). 

(74) Common Representative: NOVO NORDISK A/S; Corporate 
Patents, Novo Alle\ DK-2880 Bagsvaerd (DK). 



(81) Designated States: AL. AM. AT. AU, BB, BG, BR. BY, CA. 
CH. CN, CZ, DE, DK, EE, ES, FI, GB, GE, HU. IS. JP, KE. 
KG, KP, KR. KZ, LK. LR, LS. LT, LU, LV, MD, MG, MK. 
MN. MW. MX. NO. NZ. PL. PT, RO. RU, SD, SE, SG. 
SI, SK, TJ, TM, TR, TT, UA. UG. US, UZ, VN, ARDPO 
patent (KE, LS, MW, SD, SZ, UG), European patent (AT, 
BE. CH. DE, DK, ES, FR, GB, GR, IE, IT, LU, MC, NL, 
PT, SE), OAPI patent (BF, BJ, CF, CG, CI, CM. GA. GN, 
ML, MR, NE, SN. TD, TG). 



Published 

With international search report. 



(54) TiUe: DNA INTEGRATION BY TRANSPOSITION 
(57) Abstract 

Multicopy strains of gram-positive bacteria carrying multiple copies of a DNA sequence of interest may be constructed by use of 
a method involving introduction of a DNA construct comprising the DNA sequence of interest into the genome of the recipient cell by 
transposition and subsequent deletion of a marker gene used for selection of cells having received the DNA construct by a resolution system. 
The multicopy strains are preferably free from a gene encoding an undesirable marker such as an antibiotic resistance marker. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international 
applications under the PCT. 



AM 


Armenia 


AT 


Austria 


AU 


Australia 


BB 


Barbados 


BE 


Belgium 


BF 


Burkina Faso 


BG 


Bulgaria 


BJ 


Benin 


BR 


Brazil 


BY 


Belarus 


CA 


Canada 


CF 


Central African Republic 


CG 


Congo 


CH 


Switzerland 


CI 


Cote d'lvoire 


CM 


Cameroon 


CN 


China 


CS 


Czechoslovakia 


CZ 


Czech Republic 


DE 


Germany 


DK 


Denmark 


EE 


Estonia 


ES 


Spain 


FI 


Finland 


FR 


France 


GA 


Gabon 



GB 


United Kingdom 


GE 


Georgia 


GN 


Guinea 


GR 


Greece 


HU 


Hungary 


IE 


Ireland 


IT 


Italy 


JP 


Japan 


KE 


Kenya 


KG 


Kyrgystan 


KP 


Democratic People's Republic 




of Korea 


KR 


Republic of Korea 


KZ 


Kazakhstan 


LI 


Liechtenstein 


LK 


Sri Lanka 


LR 


Liberia 


LT 


Lithuania 


LU 


Luxembourg 


LV 


Latvia 


MC 


Monaco 


MD 


Republic of Moldova 


MG 


Madagascar 


ML 


Mali 


MN 


Mongolia 


MR 


Mauritania 



MW 


Malawi 


MX 


Mexico 


NE 


Niger 


NL 


Netherlands 


NO 


Norway 


NZ 


New Zealand 


PL 


Poland 


PT 


Portugal 


RO 


Romania 


RU 


Russian Federation 


SD 


Sudan 


SE 


Sweden 


SG 


Singapore 


SI 


Slovenia 


SK 


Slovakia 


SN 


Senegal 


SZ 


Swaziland 


TD 


Chad 


TG 


Togo 


TJ 


Tajikistan 


TT 


Trinidad and Tobago 


UA 


Ukraine 


UG 


Uganda 


US 


United States of America 


uz 


Uzbekistan 


VN 


Vict Nam 



WO 96/23073 



PCT/DK96/00038 



DNA INTEGRATION BY TRANSPOSITION 
FIELD OF THE INVENTION 

The present invention relates to a novel DNA construct useful 
for the construction of a bacterial cell having integrated more 
5 than one copy of a DNA sequence of interest into its genome, 
.which cell may be free of any selection markers, and a method 
of constructing such cells. 

BACKGROUND OF THE INVENTION 

Prokaryotic transposable elements are discrete DNA sequences 
10 capable of insertion at single or multiple sites within a 
prokaryotic genome. Normally, such elements consist of a gene 
encoding a transposase protein and a transposable cassette 
comprising a resistance gene flanked by sequences recognized by 
the transposase protein. Transposition of the transposable 
15 cassette into the genome of a host cell (which may, e.g. take 
place at random or at hot spot sites) occurs via recognition 
and interaction with the flanking sequences of the transposable 
cassette by the transposase protein. 

Different classes of transposable elements exist. One class 
20 comprises i) insertion sequences (IS) which are small (less 
than 2 kb) DNA fragments encoding transposase proteins or other 
determinants mediating transposition, and ii) composite 
transposons, i.e. DNA fragments flanked by two copies of an 
insertion sequence. The terminal portions of all is sequences 
25 comprises inverted repeat sequences. The transposase protein 
functions by recognizing these terminal sequences and interac- 
ting with the sequences to effect transpositions within the 
genome . 

The second class of transposons is the Tn3 family of tranpo- 
30 sons. These transposons encode two products involved in a two- 
step transposition process: a transposase and a resolvase. 
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Transposons belonging to this second class have inverted 
terminal repeats of approximately 35-40 bp. 

The third class includes bacteriophage Mu and related phages. 
Bacteriophage Mu is large relative to other transposons with a 

s genome of 36 kb. Mu encodes two gene products which are 
involved in the transposition process, a 70 kDa transposase and 
an accessory protein of approximately 33 kDa. An unusual 
feature of Mu that distinguishes it from other tranposons is 
that its ends are not inverted repeat sequences. The Mu 

io transposase has, however, been shown to bind to both ends in an 
in vitro binding assay. 

Transposons have been used extensively for mutagenesis and 
cloning in gram-positive and gram-negative bacteria: Youngman, 
P. J., Perkins, J. B. , Losick, R. (1983) Genetic transposition 
15 and insertional mutagenesis in Bacillus subtilis with Strepto- 
SaSSttS faecal is transposon Tn917, Proc. Natl. Acad. Sci. USA, 
80, 2305-2309? Youngman, P., Perkins, J.B., Losick, R. (1984), 
Construction of a cloning site near one end of Tn917 into which 
foreign DNA may be inserted without affecting transposition in 
20 Bacillus subtilis or expression of the transposon-borne erm 
gene, Plasmid 12, 1-9; Youngman, P. (1985) Plasmid vectors for 
recovering and expoliting Tn917 transpositions in Bacillus 
subtilis and other gram positives, p. 79-103 in K. Hardy (ed. ), 
Plasmids: a practical approach, IRL Press, Oxford; Kleckner, 
25 N., Roth, J., Botstein, D. (1977) Genetic engineering in vivo 
using translocatable drug-resistance elements. New methods in 
bacterial genetics, J. Mol. Biol., 116, 125-159; Wati, M. R. , 
Priest, F. G., Mitchell, W. J. (1990) Mutagenesis using Tn917 
in Bacillus 1 ichenif ormis . FEMS microbiol. Lett., 71, 211-214; 
so Petit, M.-A., Bruand, C, Janniere, L. , Ehrlich, S. D. (1990) 
TnlO-derived transposons active in Bacillus subtilis. J. 
Bacteriol., 172, 6736-6740. 



The latter reference describes pHV1248 and pHV1249, plasmids 
that are thermosensitive for replication, which carry a 
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transposase gene from TnlO modified to be expressed in B. 
subtilis , and sufficient sequences from the IS10 elements of 
TnlO flanking a chloramphenicol resistance gene (mini-TnlO) to 
allow transposition of mini-TnlO into the B. subtilis chromoso- 
5 me* 

Maguin et al. (Maguin, E. , Duwat, P., Hege, T. , Ehrlich, D, 
Gruss, A. (1992) New thermosensitive plasmid for gram-positive 
bacteria, J. Bacteriol. 174, 5633-5638) describe an alternative 
version of this system. 

10 EP 485 701 discloses the use of transposons for introduction of 
single copies of a DNA sequence into a prokaryotic cell genome, 
the transposase protein being encoded in cis. 

Slugenova et al*, ((1993), Enhanced a-amylase production by 
chromosomal integrtation of pTVAl in industrial strain in B^ 
15 subtilis . Biotechnology Letters, 15, 483-488) describe the 
multiple integration of plasmid pTVAl comprising a modified 
transposon Tn917 and an cr-amylase gene of interest located 
outside the transposon. Antibiotic resistance marker genes are 
present in the resulting strain. 

20 EP 0 332 488 describes a transposition based system for 
construction of multicopy bacterial strains, i.e. strains 
comprising multiple copies of a gene of interest, which strains 
further comprise multiple copies of a selectable marker gene 
introduced with the gene of interest. The system is exemplified 

25 by use of a phage Mu transposon for modification of gram- 
negative bacteria. 

WO 95/01095 describes the use of a minitransposon as a vector 
for stably tranforming an exogenous gene into a eukaryotic 
(e.g. animal) chromosome. 

30 Simon, R. , Priefer, U. , Pxihler, A. ((1983), A broad host range 
mobilization system for in vivo genetic enginering: Transposon 
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mutagenesis in gram-negative bacteria, Bio/Technology, 1, 784- 
791) describe the use of E. coli specific vectors to transfer 
transposons into other gram-negative strains by conjugation. 

Several of the above described multicopy strains have been 
5 produced by integration of a genetic construct comprising the 
gene of interest and an antibiotic selectable marker and 
amplifying said construct by culturing the cell in the presence 
of increasing dosages of antibiotic. Thus, the resulting cell 
typically comprises a number of antibiotic resistance genes, 
10 The presence of such genes are undesirable, in particular from 
an environmental and a product approval point of view. 

Non-antibiotic selection markers have been used for construc- 
tion of multicopy strains. Herrero, M. , de Lorenzo, v. , Timmis, 
K.N. ((1990), Transposon vectors containing non-antibiotic 
15 resistance selection markers for cloning and stable chromosomal 
insertion of foreign genes in gram-negative bacteria, J. 
Bacteriol., 172, 6557-6567) describe such as system in which 
herbicide or heavy metal resistances are used as selection 
markers . 

20 Another alternative to the use of antibiotic resistance markers 
are described in DE 4 231 764 in which an alternating selection 
of Thy" (trimethoprim resistance) and Thy* (thy prototrophy) is 
used for introduction of product genes in Bacillus spp. thereby 
avoiding the need for selectable markers. 

25 Specific deletion of DNA segments from the chromosomes of bac- 
terial species have traditionally been performed by the methods 
of homologous recombination (Hamilton, C. H. , Aldea, M. , Wash- 
burn, B. K., Babitzke, P. (1989). New method of generating 
deletions and gene replacement in Escherichia coli. J. Bac- 

30teriol., 171, 4617-4622.; Maguin, E., Duwat, P., Hege, T. , 
Ehrlich, D, Gruss, A. (1992). New thermosensitive plasmid for 
gram-positive bacteria. J. Bacterid. 174, 5633-5638.). 
However, the use of homologous recombination to delete re- 
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sistance marker genes from strains having multiple, tandem 
copies of such genes each linked to a copy of the gene of in- 
terest is hardly applicable, as homologous recombination would 
delete also the extra copies of the gene of interest. 

5 The concept of using site specific recombination systems for 
integration and retrieval of sequences from the bacterial 
chromosome, using elements from either phage lambda or PI, and 
some specific methods of achieving this, has been described 
(Hasan, N. , Koob, M. , Szybalski, W. (1994), Escherichia coli 

10 genome targeting, I. Cre-lox-mediated in vitro generation of 
ori" plasmids and their in vivo chromosomal integration and 
retrieval. Gene, 150, 51-56). The cre-lox system is further 
described by Abremski, K., Hoess, R. , Sternberg, N. (1983), 
Studies on the properties of PI site-specific recombination: 

15 Evidence for topologically unlinked products following recombi- 
nation, Cell, 32, 1301-1311. An alternative system is based on 
the recombination system of the broad-host range plasmid RP4 
(Eberl, L. , Kristensen, C.S., Givskov, M. , Grohmann, E. , 
Gerlitz, M., Schwab, H. (1994), Analysis of the multimer 

20 resolution system encoded by the parCBA operon of broad-host- 
range plasmid RP4 r Mol. Microbiol., 12, 131-141)). Stark, W.M. , 
Boocock, M.R. , Sherratt, D.J. (1992), Catalysis by site- 
specific recombinases, Trends in Genetics, 8, 432-439) is a 
review article on the mechanism of resolvase action. Camilli et 

25 al. ((1994) , Use of genetic recombination as a reporter of gene 
expression, Proc. Natl. Acad. Sci. USA, 91, 2634-2638) describe 
the use of res sites and resolvase from the 7 S transposon in 
Vibrio cholera as a permanent, heritable marker of gene 
expression from a chromosomal gene. The resolution system is 

30 not used for excision of marker genes. Chang, L.-K. et al. 
((1994, Construction of Tn917asl, a transposon useful for 
mutagenesis and cloning of Bacillus subtilis genes, Gene, 150, 
129-134) describe the plasmids (pE194) containing erm-res-tnpA 
(transposase)-tnpR (resolvase) samt IR-res-ori colEl-AB R l-AB R 2- 

35 IR (pD917; Tn917acl) . The two res sites are there to allow the 
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transposon to function properly, not for excision of inter- 
vening DNA. 

The broad host range, gram-positive plasmid pAM01 (Clewell, D. 
B., Yagi, Y. , Dunny, G. M. , Schultz, S. K. (1974) Charac- 
5 terization of three plasmid deoxyribonucleic acid molecules in 
a strain of Streptococcus faecalis: identification of a plasmid 
determining erythromycin resistance. J. Bacterid. 117, 283- 
289) has been described to contain a resolution system, that 
resolves plasmid multimers into monomers via a site specific 

io recombination event, requiring a specific plasmid encoded en- 
zyme (resolvase) and a site, res, on the plasmid (Swinfield, 
T.-J., Janniere, L. , Ehrlich, S. D., Minton, N. P. (1991). 
Characterization of a region of the Enterococcus faecalis plas- 
mid pAM01 which enhances the segregational stability of pAM01- 

15 derived cloning vectors in Bacillus subtilis. Plasmid 26, 209- 
221; Janniere, L. , Gruss, A., Ehrlich, S. D. (1993) Plasmids, 
pp. 625-644 in Sonenshein, A. L. , Hoch, J. A., Losick, R. 
(eds.) Bacillus subtilis and other gram-positive bacteria: 
Biochemistry, Physiology and molecular genetics. American soci- 

20 ety for microbiology, Washington D.C.)- 

It has been suggested to use a site-specific recombination 
system to remove a single selectable marker gene from the 
genome of a bacterial cell. For instance, Dale, et al. ((1991) 
Gene transfer with subsequent removal of the selection gene 

25 from the host genome, Proc. Natl. Acad. Sci. USA, 88, 10558- 
10562) describe the use of the cre/lox system for removal of 
markers from transgenic plants and mentions that the use of 
this system would obviate the need for different selectable 
markers in subsequent rounds of gene tranfer into the same 

30 host. Kristensen, C.S. et al. (1995), J. Bacterid., 177, 52- 
58, describe the use of the multimer resolution system of the 
plasmid RP4 for the precise excision of chromosomal segments 
(such as marker genes introduced with heterologous DNA) from 
gram-negative bacteria. It is stated that the system is 

35 envisaged to be of interest in the generation of chromosomal 
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insertions of heterologous DNA segments eventually devoid of 
any selection marker. 

WO 95/02058 describes a new transposon (tn5401) from B. 
thurinaiensis containing transposase, resolvase, and res site. 

5 The transposon is used in a plasmid which contains B. thurinai- 
ensis DNA (e.g. origin and toxin gene) and, flanked by res 
sites, non- B. thurinaiensis DNA (e.g. E. coli origin, selec- 
table marker genes) . The plasmid is introduced into B. thurin- 
aiensis . Subsequently, a plasmid expressing the resolvase is 

10 introduced (e.g. a thermosensitive plasmid containing the 
entrire tranposon - but only used as resolvase donor) whereby 
the non- B. thurinaiensis DNA is excised from the first plasmid. 

Conclusions with respect to the state of the art 
On the basis of the above citations, the following conclusions 
15 may be made as to the state of the art: 

The insertion of multiple genes of interest by transposition 
was known, e.g. as described in EP 332 488. However, all 
strains carrying multiple transposed sequences of interest 
contain selectable markers. 

20 The removal of markers via site-specific recombination was know 
from either chromosome or plasmids (cf. Kristensen et al. 
(1995), Eberl et al. (1994), WO 95/02058). It was known to 
remove a marker introduced by transposon. 

Multicopy strains without presence of heterologous, selectable 
25 marker genes were known (DE 4231 764). These strains were 
constructed by a cumbersome method depending on the use of the 
Thy marker. 

It is an object of the present invention to construct bacterial 
cells which harbour a stable, fixed and well-defined copy 
30 number of one or more genes of interest, without the presence 
of selectable marker genes in the final strain. 
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BRIEF DESCRIPTION OF THE INVENTION 

The present invention is based on the surprising finding that 
transposition of a DNA construct which, in addition to trans- 
posase target sequences and the transposase gene necessary to 
5 achieve transposition, contains target sequences for a site- 
specific recombination enzyme, may be used for construction of 
multicopy gram-positive bacterial strains. 

In a first aspect the invention relates to a DNA construct 
useful for integration into the genome of a host cell, the 
io construct comprising the structure IR(1) -P-R-M2-R-IR(2) or 
IR(1)-R-M2-R-P-IR(2) , wherein 

IR(1) and IR(2) denote transposase target sequences, 
P a DNA fragment comprising a DNA sequence of inter- 
est, 

is R a target sequence for a site-specific recombination 

enzyme, and 

M2 a selectable marker gene, 

the structure being associated with a gene (T) encoding a 
transposase protein capable of recognizing and interacting with 
20 the transposase target sequences IR(1) and IR(2) and located 
outside the structure defined by the IR(1) and IR(2) sequences. 

In the present context the term "associated with" is intended 
to indicate that T is present on the DNA construct of the in- 
vention, but need not be in direct contact with the structure 

25 defined by IR(1) and IR(2), i.e. located within and comprising 
IR(1) and IR(2). Thus, linker or other sequences may be present 
in between the structure and T. The exact location of T is not 
critical as long as the gene is located outside the structure 
defined by IR(1) and IR(2) . Thus, T may be located on either 

30 side of the structure defined by IR(1) and IR(2). 

In the present context the term "selectable marker gene" is 
intended to indicate a DNA sequence which encodes a gene 
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product which provides to the host cell expressing the product 
a selectable characteristic, such as resistance towards an 
antibiotic. The DNA sequence M2 may further comprise one or 
more regulatory elements required for or involved in the ex- 
5 pression of the DNA sequence encoding the selectable marker, 
such as a promoter, a terminator and the like. The regulatory 
elements may be either heterologous or homologous to the DNA 
sequence . 

The term "transposase gene" is intended to indicate a DNA 
10 sequence encoding a transposase protein, i.e. a protein which 
is essential for transposition to take place. The gene may 
further comprise one or more regulatory elements required for 
or involved in the expression of the DNA sequence encoding the 
transposase protein, such as a promoter, a terminator and the 
15 like. The regulatory element (s) may be either heterologous or 
homologous to the DNA sequence. 

The term "transposase target sequence" is intended to indicate 
DNA sequences recognized by the transposase protein encoded by 
the transposase gene T. The transposase target sequences IR(1) 

20 and IR(2) shall contain sufficient DNA derived from transposon 
terminal sequences to retain the function of these sequences in 
allowing transposition of the structure comprising and located 
within these sequences upon expression of the transposase 
encoded by T. The minimal sequences that are sufficient for 

25 transposition vary in composition and length depending on which 
transposon the system is developed from. For instance, for TnlO 
derived transposase target sequences, either 2 3 or 42 basepairs 
are enough to allow transposition (Kleckner, N. (1988) Trans- 
poson TnlO. pp 227-268 in Berg, D. E. , and Howe, M. M. (eds) . 

30 Mobile DNA. American society for microbiology, Washington, 
D.C. ) . 

In the present context the "target sequence for a site-specific 
recombination enzyme" is intended to indicate a DNA sequence 
which is recognized by a site-specific recombinase enzyme 
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(discussed in detail further below) . The target sequences may 
be, but need not be identical. Thus, variations between the 
sequences may occur as long as the recombinase is capable of 
recognizing and interacting with the sequences. 

5 The term "DNA sequence of interest" is used to indicate a 
sequence which codes for a desired RNA or protein product 
(heterologous or native to the host cell) or which in itself 
provides the host cell with a desired property, e.g. a mutant 
phenotype. When appropriate, the DNA sequence may also comprise 

io one or more regulatory elements required for or involved in the 
expression of the DNA sequence encoding the desired RNA or 
protein product, such as a promoter, a terminator and the like. 
The regulatory element (s) may be either heterologous or homolo- 
gous to the DNA sequence. 

15 The novel DNA construct of the invention has been found of 
particular use in construction of strains which in their genome 
comprise more than one, and preferably multiple, randomly 
located copies of integrated DNA sequences, which cells in 
preferred aspects are free from genes encoding a selectable 

20 marker the presence of which is undesirable, e.g., from an 
environmental point of view. It will be noted that the DNA con- 
struct of the invention is different from those provided in EP 
485 701 in that the selectable marker gene in the present 
invention is located outside the structure flanked by trans- 

25 posase target sequences. This location is essential for 
constructing marker-free cells in accordance with the present 
invention. 

An alternative approach to using the above described recombi- 
nase system for deleting a marker gene used in the construction 
30 of modified cells by transposition is to effect the deletion of 
the marker gene by homologous recombination. Accordingly, in a 
second aspect of the present invention relates to a DNA con- 
struct is provided which comprises the structure IR(1) -P-r'-M2- 
R"-IR(2) or IR(1)-R'-M2-R"-P-IR(2) , wherein 
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IR(1) and IR(2) denote transposase target sequences, 
P a DNA fragment comprising a DNA sequence of inter- 
est, 

R 1 and R", respectively, a DNA sequence which is pro- 
5 vided in a parallel repeat on either side of the 

selectable marker gene M2, and 
M2 a selectable marker gene, 

the structure being associated with a gene (T) encoding a 
transposase protein capable of recognizing and interacting with 
io the transposase target sequences IR(1) and IR(2) and located 
outside the structure defined by the IR(1) and IR(2) sequences, 
i.e. outside the structure IR(1) -P-R'-M2-R"-IR(2) or IR(1)-R*- 
M2-R"-P-IR(2) . 

The term "provided in a parallel repeat" is intended to 
is indicate that the DNA sequences R 1 and R" flanking M2 are 
sufficiently homologous to allow for homologous recombination 
to take place between the sequences. Preferably, each of the 
sequences R* and R" comprises a substantially identical segment 
of at least 20 nucleotides, more preferably of at least 50 
20 nucleotides, such as 50-100 nucleotides, and even more pre- 
ferably at least 500 (such as 500-1000 nucleotides) and most 
preferably at least 1000 nucleotides, such as 1000-3000 
nucleotides. 

The presence of the target sequences for a site-specific 
25 recombination enzyme (first aspect) or of the parallel repeats 
(second aspect) enables the specific deletion of the marker 
gene M2 from the genome of a host cell which in its genome have 
received the marker gene by transposition. More specifically, 
the DNA constructs according to the first and second aspect of 
30 the invention, respecitively , in which the marker gene M2 is 
located inbetween target sequences for a site-specific recombi- 
nation enzyme or inbetween parallel repeat sequences, is 
particularly useful for the construction of marker-free, 
multicopy strains (i.e. bacterial cells which in their genome 
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comprise multiple copies of an integrated DMA sequence of 
interest) . In short each of the DNA constructs is intended to 
be used in a two-step integration process, wherein, in the 
first step, genomic integration of the fragment IR(1) -P-R-M2-R- 

5 IR(2) or IR(1)-R-M2-R-P-IR(2) (the first aspect) , or IR(1)-P-R'- 
M2-R*'-IR(2) or IR(1)-R'-M2-R"-P-IR(2) (the second aspect), is 
accomplished by transposition and selection for M2*-cells, and, 
in the second step, the selectable marker gene M2 is eliminated 
by a site-specific recombinase enzyme provided in trans and 

10 interacting with the target sequences R flanking the marker 
gene M2 (the first aspect) or by recombination between the 
parallel repeat sequences R' and r" (the second aspect). 

In the present context the term "marker- free" is intended to 
indicate that the cell having integrated multiple copies of a 
15 DNA sequence of interest does not express a selectable marker, 
such as an antibiotic resistance, the presence of which is 
undesired in the cell, which marker is necessary for or 
advantageously used in the construction of the cell, 

DETAILED DESCRIPTION OF THE INVENTION 

20 In the following the various aspects of the invention is 
discussed in further detail. 

The DNA construct of the invention 

In addition to the DNA constructs according to the first and 
second aspect of the invention, which are described above, in 
25 a third aspect, the present invention relates to a DNA con- 
struct comprising the structure IR(1) -R-M2-T-R-P-IR(2) , IR(1)- 
P-R-M2-T-R-IR(2) , IR( 1) -R-T-M2-R-P-IR (2 ) or IR( 1) -P-R-T-M2-R- 
IR(2), wherein 

IR(1) and IR(2) denote transposase target sequences, 
30 P a DNA sequence of interest, 

R a target sequence for a site-specific recombination enzyme, 
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M2 a selectable marker gene, and 
T a transposase gene T. 

As compared to the DNA construct according to the first aspect 
of the invention the DNA construct of the third aspect compri- 
5 ses the transposase gene T inside the structure defined by 
IR(1) and IR(2) rather than outside said structure. 

Analogously, in a fourth aspect the- invention relates to a DNA 
construct comprising the structure IR(1) -R , -M2-T-R' , -P-IR(2) , 
IRflJ-P-R'^-T-R' '-IR(2) , IR( 1) -R' -T-M2-R 1 ■ -P-IR ( 2 ) orlR(l)- 
10 P-R'-T-M2-R l «-IR(2) , wherein 

IR(1) and IR(2) denote transposase target sequences, 
P a DNA sequence of interest, 
R' and R 1 1 parallel repeat sequences, 
M2 a selectable marker gene, and 
is T a transposase gene T. 

The DNA construct of the invention is preferably one, which is 
capable of being transposed in cells of gram-positive bacteria, 
in particular strains of Bacillus . 

Selectable marker 

20 In order to facilitate selection for cells in which the part of 
the DNA construct of the invention located within and compri- 
sing the inverted repeat sequences IR(1) and IR(2) (e.g. the 
structure IR ( 1 ) -P-R-M2 -R-IR ( 2 ) or IR ( 1 ) -R-M2 -R-P-IR ( 2 ) in 
connection with the first aspect of the invention) has been 

25 integrated in the genome, it is advantageous that the DNA 
construct comprises a further selectable marker gene Ml outside 
the structure defined by and comprising IR(1) and IR(2) . In 
this case, the DNA construct comprises a first selectable 
marker gene Ml present outside the structure defined by and 

30 comprising IR(1) and IR(2) and a second marker M2 within said 
structure. Thereby, when Ml is different from M2, it is 
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possible to make a two step selection when the DNA construct of 
the invention is transposed into the genome of a recipient 
cell, the first step selecting for cells in which the DNA con- 
struct has been introduced (Ml*, M2* cells) and a second step 

5 selecting for cells in which the structure defined by IR(1) and 
IR(2) comprising the marker gene M2 has been integrated into 
the genome and the remaining part of the DNA construct (car- 
rying Ml) has been lost (Ml*, M2*) . Subsequently, the marker 
gene M2 may be eliminated from the resulting cells by the 

10 action of a resolvase protein. 

The selectable marker may be a gene coding for a product which 
confers antibiotic resistance to a cell expressing the gene or 
a non-antibiotic marker gene, such as a gene releiving other 
types of growth inhibition, i,e. a marker gene which allow 

15 cells containing the gene to grow under otherwise growth- 
inhibitory conditions. Examples of such genes include a gene 
which confers prototrophy to an auxotrophic strain, e.g. dal 
genes introduced in a dal " strain (cf. B. Diderichsen in 
Bacillus; Molecular Genetics and Biotech nology Applications , 

20 A.T. Ganesan and J. A. Hoch, Eds., Academic Press, 1986, pp. 35- 
46) or a thy gene introduced in a thy— cell (cf. Gryczan and 
Dubnau (1982), Gene, 20, 459-469) or a gene which enables a 
cell harbouring the gene to grow under specific conditions such 
as an amdS gene, the expression of which enables a cell 

25 harbouring the gene to grow on acetamide as the only nitrogen 
or carbon source (e.g. as described in EP 635 574), or a gene 
which confers resistance towards a heavy metal (e.g. arsenite, 
arsenate, antimony, cadmium or organo-mercurial compounds) to 
a cell expressing the gene. Cells surviving under these 

30 conditions will either be cells containing the introduced DNA 
construct in an extrachromosomal state or cells in which the 
above structure has been integrated. Alternatively, the 
selectable marker gene may be one conferring immunity to a cell 
expressing the gene. 
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The DNA sequence of interest 

The DNA sequence of interest present in the DNA construct of 
the invention may be a DNA sequence having or encoding any 
function. For instance, the DNA sequence may comprise a 

5 sequence encoding a structural or regulatory protein, or may 
comprise a regulatory sequence such as a promoter. Alternati- 
vely, the inserted sequence may be one not known to possess any 
biological function, which can be used to interrupt a cellular 
function, e.g. by inserting itself within an essential gene 

io thereby interrupting the function of the gene. The DNA sequence 
of interest may be a gene and thus associated with the neces- 
sary regulatory elements required for its expression including 
a promoter, a terminator or a ribosome binding site. 

As will be understood from the above the present invention is 
15 of particular use in constructing bacterial cells comprising 
multiple copies of a DNA sequence of interest. Such multicopy 
strains are of particular interest for the industrial produc- 
tion of polypeptides of interest and, accordingly, in a highly 
preferred embodiment the DNA sequence of interest encodes a 
20 polypeptide of interest. 

The polypeptide may be a translocated polypeptide, i.e. a 
polypeptide which, when expressed, carries a signal sequence 
which enables it to be translocated across the cell membrane. 
In particular, the translocated polypeptide may be a secreted 
25 polypeptide or a polypeptide involved in the secretory machi- 
nery of the bacterial cell in question. 

The polypeptide, whether secreted or not, may be an enzyme, 
e.g., selected from an amylolytic enzyme, a lipolytic enzyme, 
a proteolytic enzyme, a cellulytic enzyme, an oxidoreductase or 
30 a plant cell-wall degrading enzyme. Examples of such enzymes 
include AMG, amylase, lipase, cutinase, esterase, cellulase, 
hemicellulase, protease, peroxidase, laccase, phenoloxidase, 
catalase, glucose oxidase, phytase, lyase, pectinase, glucosi- 
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dase, mannosidase, isomerase, invertase, trasf erase, ribonucle- 
ase, galactosidase and chitinase. Alternatively, the secreted 
polypeptide may be a hormone, a growth factor, a receptor or 
the like. 

5 A preferred example of a translocated polypeptide of the 
secretory pathway is PrsA (WO 94/19471) which, when overex- 
pressed in cells of Bacillus , has been found to result in an 
increased secretion of a secreted polypeptide of interest from 
such cells. 

10 Transposase and transposase reco gnition sequences 

It will be understood that the transposase gene and the 
transposase target sequences must be chosen so as to be capable 
of functioning together. In the present invention any trans- 
posase recognition sequence may be used together with the 

15 corresponding transposase. Typically, the transposase gene and 
the transposase target sequences are derived from the same 
class of transposons (cf the Background of the Invention 
section above) , and preferably from the same transposon. The 
transposase recognition sequences may be inverted repeat 

20 sequences, e.g. derived from Tnl, Tn2, Tn3, Tn5, Tn9, TnlO and 
Tn903. 

Alternatively, the transposase recognition sequences are free 
from inverted repeats. Examples of such sequences are those 
derived from the transposase recognition sequences of bac- 
25 teriophage Mu and related transposons. When the transposase 
recognition sequences are derived from bacteriophage Mu it may 
be necessary to supply an accessory protein in addition to the 
transposase (cf the Background of the Invention section above) . 

The transposase gene and the transposase target sequences may 
30 be derived from naturally occurring transposons either by being 
isolated from the relevant source by use of standard techniques 
or by being synthesized on the basis of known sequences. It 
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will be understood that functional analogues or derivatives of 
naturally occurring sequences may be used as long as they are 
capable of mediating the transposition. The functional analogu- 
es or derivatives may be prepared synthetically and may differ 
5 from the wild type sequence in one or more nucleotides. 

Resolvase and resolvase target sequences 

It will be understood that the essential feature according to 
the second aspect of the invention is the use of a site- 
specific recombination system to specifically delete the marker 

io gene M2, once transposition into the host genome has taken 
place. Several site-specific recombination systems are known 
(cf the Background of the Invention section above) all of which 
are contemplated to be useful for use in the present invention. 
For the purpose of the present invention, the preferred use is 

15 of a system which consists of only two elements: a site 
specific recombinase enzyme and a target sequence for said 
enzyme . 

Examples of such systems are the pAM01 resolvase having as 
target sequence the pAM01 res sequence (Janniere, L. , Gruss, 

20 A. , Ehrlich, S. D. (1993), Plasmids, pp. 625-644 in Sonenshein, 
A. L., Hoch, J. A., Losick, R. (eds.) Bacillus subtilis and 
other gram-positive bacteria: Biochemistry, Physiology and 
molecular genetics, American society for microbiology, Washing- 
ton D.C.) and the phage PI Cre enzyme having as target sequence 

25 the PI lox site (Hasan, N. , Koob, M. , Szybalski, W. (1994). 
Escherichia coli genome targeting, I. Cre-lox-mediated in vitro 
generation of ori* plasmids and their in vivo chromosomal inte- 
gration and retrieval, Gene, 150, 51-56), 

It will be understood that the DNA sequence encoding the 
30 recombinase enzyme and the target sequence for this enzyme may 
be derived from naturally occurring systems (as described 
above) either by being isolated from the relevant source by use 
of standard techniques or by being synthesized on the basis of 
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known sequences* It will be understood that functional analogu- 
es or derivatives of naturally occurring sequences may be used 
as long as they are capable of functioning in the intended 
manner. The functional analogues or derivatives may be prepared 
5 synthetically and may differ from the wild type sequence in one 
or more nucleotides. 

For some purposes it may be advantageous to introduce the gene 
encoding the recombinase enzyme (when provided in trans) into 
the cell by use of a naturally occurring plasmid, such as 
10 pAM01, comprising the gene. 

The DNA sequence encoding the recombinase enzyme is preferably 
provided in trans. However, it may be advantagous that the DNA 
sequence is present in the vector which carries the DNA 
construct of the invention. In this case the DNA sequence is 

15 preferably located in the part of the vector which is integra- 
ted into the genome of the host cell, and most preferably on 
the part of the DNA which after integration is excised from the 
genome due to the action of the recombinase enzyme. In addi- 
tion, the DNA sequence should be located under the control of 

20 an inducible or regutable promoter such as a temperature 
inducible promoter or a xylose inducible promoter or the SPAC 
promoter, so that the expression from the DNA sequence could be 
controlled to take place only after transposition has taken 
place. If the DNA sequence encoding the recombinase enzyme is 

25 provided in cis and under the control of a regutable promoter 
the method of the invention may be accomplished using only one 
transformation step. 

Furthermore, the gene encoding the site specific recombination 
enzyme may be engineered by methods well known in the art to 
30 allow its proper expression in the host strain. It may by 
transferred onto a temperature-sensitive or otherwise conditio- 
nal replicon, to allow for easy loss of the gene from the host 
cell once the site-specific recombination event has occurred. 
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Homologous recombination in c ombination with a counterselec- 
tabje marker 

Although the selectable marker gene may be deleted by homolo- 
gous recombination between the DNA sequences R 1 and R" when 
5 using the DNA construct according to the second aspect of the 
invention such homologous recombination may occur with a very 
low frequency for which screening may be difficult to achieve 
in actual practice. In order to avoid this practical problem, 
it may be advantageous that the selectable marker gene M2 is a 
io gene which in the host cell of choice can be counterselected. 
In other words, it may be desirable to use a marker, the 
presence as well as the absence of which can be selected. For 
instance, the marker gene may be one conferring prototrophy to 
an auxotrophic cell, e.g. a thv gene for use in a thy 1 - cell 
15 (Gryczan, T.J., and Dubnau, D. (1982), Direct selection of 
recombinant plasmids in Bacillus subtil is . Gene, 20, 459-469), 
or a gene which enables a cell expressing the gene to grow on 
a specific substrate, e.g. amdS, the expression of which allows 
cells to grow on acetamide as the only nitrogen or carbon 
20 source (EP 635 574). When the thv gene is used a selectable 
marker 142 the DNA construct according to the second aspect is 
introduced into a thy" -cell allowing for selection of cells 
having received the construct (first step) and in a second step 
the resulting cells are subjected to growth on a medium which 
25 cannot tolerate the presence of the thy gene, thereby allowing 
for selection of cells having lost the thv gene by homologous 
recombination (the principle of using a thy gene as a counter- 
selectable marker is described in detail by Gryczan and Dubnau 
op. cit. ) . Analogously, the amdS gene may be used in a two step 
30 procedure, wherein in a first step selection is made for amdS- 
containing cells by growth on acetamide as the only carbon or 
nitrogen source, and in a second step counterselection is made 
by growing the cells on a medium containing f louracetamide and 
urea as nitrogen sources (cf EP 635 574) whereby cells are 
35 selected which do not contain the amdS gene. 
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Construction of the DNA co nstruct of the invention 

A DNA construct of the invention may conveniently be con- 
structed on the basis of naturally occurring or otherwise 
existing transposon delivery vectors. The transposon delivery 

5 vectors described in the literature contains a transposase 
gene, if needed modified so as to be expressed in the intended 
host organism, and a transposable cassette, essentially 
containing a resistance gene flanked by sufficient DNA derived 
from the ends (e.g. inverted repeat sequences) of the original 

io transposon to enable the transposition of the resistance gene 
into the host genome. 

In one embodiment the DNA construct of the invention may be 
prepared by modification of such transposon delivery vectors so 
as to contain, within the structure that by transposition 

15 integrates into the host genome, a DNA sequence of interest, 
either replacing or in addition to the antibiotic resistance 
gene. It is further possible to replace the antibiotic re- 
sistance gene with any other selectable marker gene. The 
modification of the transposon delivery vectors may be ac- 

20 complished by methods known in the art. In an alternative em- 
bodiment the DNA construct of the invention is prepared by 
joining each of the isolated elements to be comprised by the 
DNA construct using methods known in the art. 

When the selectable marker gene is to be eliminated from the 
25 cells in which the DNA construct has been integrated the marker 
should be flanked by resolvase target sequences or by parallel- 
ly repeated homologous DNA sequences. These sequences may be 
inserted into the DNA construct prepared as described above by 
methods known in the art. It is important that the gene of 
30 interest is located outside the structure flanked by these 
target sequences. 
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The vector of the invention 

Qriqin off replication 

For use in the present invention it is generally preferred that 
the DNA construct of the invention is linked to an origin of 
5 replication, i.e. present on a vector. In the present context 
the term "vector" is intended to denote a DNA molecule capable 
of functioning as an autonomously replicating extrachromosomal 
element. The vector may, e.g., be a plasmid or a bacteriophage. 

The origin of replication should not be located between the 
10 transposase recognition sequences IR(1) and IR(2) since such 
location would result in the origin of replication being 
integrated into the genome of the host cell. Genomic integra- 
tion of an origin of replication is undesirable from a stabili- 
ty point of view. 

15 Except for the above the location of the origin of replication 
is not critical. Thus, in a DNA construct of the invention the 
origin of replication may be located on either side of "T" and 
"Ml" (when present) . 

If it is desired to improve the efficiency with which cells 
20 harbouring an integrated DNA construct can be isolated, the DNA 
construct may be present on a vector comprising a conditional 
origin of replication. In other words a vector may be used 
which is able to replicate under certain (permissive) condi- 
tions and unable to replicate under other (non-permissive) 
25 conditions. The vector may, for instance, be one which is 
temperature-sensitive for replication. Thus, in an embodiment 
of the method of the invention, the vector comprising the DNA 
construct to be integrated is one which is unable to replicate 
at increased temperatures, which yet permit growth of the host 
30 cell. The bacterial cells are initially cultured at a tempera- 
ture permitting vector replication and subsequently, after 
integration of the DNA construct of the invention, into the 
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bacterial genome may have taken place, cultured at a tempera- 
ture which does not permit vector replication so that the 
vector introduced into the cell is lost from the cells. The 
cultivation at the non-permissive temperature is conducted 
5 under selective conditions to ensure that only cells containing 
the integrated DNA construct will survive. 

Another way of increasing the efficiency of integration and 
subsequent loss of the vector from the cells may be to treat 
the cells transformed with the vector with a plasmid-curing 
io agent, e.g. novobiocin (Gado, I. et al., 1987. Zbl. Bakt. Hyg. 
A. 265, 136-145), after culturing the host cells under selec- 
tive conditions as described above. 

A still further type of a conditional replication origin may be 
one of a limited host-range plasmid, i.e. a plasmid which can 
is only replicate in a limited number of microbial species. For 
the present purpose an origin of replication should be chosen 
which is derived from a limited host-range plasmid which is 
unable to replicate in the recipient cell into which the DNA 
cnstruct is to be transposed. 

20 When the DNA construct of the invention is constructed by 
modification of transposon delivery vectors the origin of 
replication normally present on these vectors may be used. 
Otherwise, the origin of replication may be provided, con- 
veniently by inserting the DNA construct into a vector suitable 

25 for the introduction into the intended host cell. The integra- 
tion of the transposable cassette into the genome of the host 
cell may be accomplished following the procedures described in 
the literature (Petit et al., 1990 op. cit. ) for use of 
transposon delivery vectors for mutagenesis. 

30 Cis-actina coniuaational elements 

It may be desirable to introduce the vector of the invention 
carrying a DNA construct of the invention into the recipient 
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cell in question by means of conjugation (further described in 
the section below entitled "Construction of single or multi- 
copy, optionally marker- free strains"). For this purpose the 
DNA construct or vector comprises a so-called cis-acting DNA 
5 sequence which is required for conjugation to take place. 

Accordingly, when a vector carrying a DNA construct of the 
invention is to be introduced into a recipient cell by con- 
jugation the DNA construct or vector further comprises a cis- 
acting DNA sequence required for transfer of a plasmid compri- 
io sing the DNA construct. 

The cis-acting DNA sequence may conveniently be any DNA 
sequence or DNA site capable of mediating mobilization, a 
preferred example being oriT of the tetracycline resistance 
plasmid pBC16 or of the Staphylococcus aureus kanamycin re- 
is sistance plasmid pUBHO (as described by Sellinger et al., 
Journal of Bacteriology, June 1990, pp. 3290-3297) or a func- 
tional analogue or part of oriT. 

For all types of vectors carrying a DNA construct of the in- 
vention (harbouring any of the structures as defined in 

20 connection with the first, second, third and fourth aspect of 
the invention) the cis-acting sequence may be located on the 
vector, but outside the structure defined by and comprising 
IR(1) and IR(2) , whereby the sequence will be excised together 
with the transposon delivery vector. Thus, the cis-acting DNA 

25 sequence may be located on either side of said structure and of 
the DNA sequence T and the DNA sequence Ml, when present. 

In an preferred embodiment of the first, second, third and 
fourth aspect of the invention, the cis-acting sequence is 
located in the part of the structure located within and compri- 
30 sing the resolvase target sequences R and the parallel repeat 
sequences R 1 and R' 1 , respectively. Thereby, the cis-acting 
sequence may be excised - together with the selectable marker 
gene M2 and optionally T (second and fourth aspect of the 
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invention) - from the genome of the cell in which the DNA 
structure has been integrated. 

Construction of single or multicopy, optiona lly marker-free 

strains 

. 5 As it will be understood the construction of multicopy, marker- 
free production strains may be achieved by use of the above 
described DNA constructs allowing the stepwise integration of 
one or more copies of a DNA sequence of interest into the 
genome of a bacterial cell, which method comprises repeated 
10 steps of transposition followed by site-specific marker dele- 
tion. 

Accordingly, in further aspects the invention relates to 
methods of constructing a bacterial cell of the invention by 
use of the DNA construct according to the first, second, third 
15 and fourth aspect, respectively, as further defined above. 

Accordingly, in a further aspect the invention relates to a 
method of constructing a bacterial cell, which in its genomic 
DNA has integrated more than one copy of a DNA sequence of 
interest, and which is free from a DNA sequence encoding an 
20 undesired selectable marker, which method comprises 

a) introducing a first vector comprising a DNA construct 
according to the first aspect of the invention comprising the 
structure IR(1) -P-R-M2-R-IR(2) or IR(1) -R-M2-R-P-IR(2) in 
association with a tranposase gene T and optionally a selec- 

25 table marker gene Ml into the host cell, in which R denotes a 
target sequence for a site-specific recombination enzyme, 

b) selecting for cells being M2 + and optionally Ml-, which in 
their genome comprises the structure IR(1) -P-R-M2-R-IR(2) or 
IR(1)-R-M2-R-P-IR(2) , 
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c) introducing a second vector comprising a DNA sequence 
encoding a site specific recombinase into the cells selected in 
step b) so as to excise the structure R-M2 or M2-R from the 
genome of the cell whereby cells are obtained having integrated 

5 the structure IR(1) -R-P-IR(2) or IR(1) -P-R-IR(2) , 

d) curing the cells resulting from step c) for the second 
plasmid, and optionally 

e) repeating steps a-d) one or more times to produce bacterial 
cells comprising one or more additional copies of the structure 

10 IR(1)-R-P-IR(2) or IR(1)-P-R-IR(2) . 

In a still further aspect the invention relates to a method of 
constructing a bacterial cell, which in its genomic DNA has 
integrated more than one copy of a DNA sequence of interest, 
and which is free from a DNA sequence encoding an undesired 
15 selectable marker, which method comprises 

a) introducing a first vector comprising a DNA construct 
according to the third aspect of the invention comprising the 
structure IR(1) -R-M2-T-R-P-IR(2) , IR(1) -P-R-M2-T-R-IR(2) , 
IR(1)-R-T-M2-R-P-IR(2) or IR(1) -P-R-T-M2-R-IR(2) optionally in 

20 association with a selectable marker gene Ml into the host 
cell, 

b) selecting for cells being Ml" (if Ml was present on the 
vector) and M2*, which cells in their genome comprises one of 
the structures identified in a) , 

25 c) selecting for cells having an increased number of copies of 
the marker gene M2 , 

d) introducing a second vector comprising a DNA sequence 
encoding a site specific recombinase into the cells selected in 
step b) so as to excise the structure R-M2-T, R-T-M2, M2-T-R or 
30 T-R-M2 from the genome of the cell whereby cells are obtained 
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having integrated the structure IR(1) -R-P-IR(2) or IR(1)-P-R- 
IR(2) , 

e) curing the cells resulting from step d) for the second 
plasmid, and optionally 

5 f) repeating steps a-e one or more times to produce bacterial 
cells comprising one or more additional copies of the structure 
IR(1)-R-P-IR(2) or IR(1) -P-R-IR(2) . 

In a still further aspect the invetion relates to a method for 
constructing a bacterial cell, which in its genomic DNA has 
10 integrated more than one copy of a DNA sequence of interest, 
and which is free from a DNA sequence encoding an undesired 
selectable marker, which method comprises 

a) introducing a first vector comprising a DNA construct 
according to the second aspect of the invention comprising the 

15 structure IR(1) -P-r'-M2-r"-IR(2) or IR(1) -r'-M2-r"-P-IR(2) in 
association with a tranposase gene T and optionally a selec- 
table marker gene Ml into the host cell, in which R' and R" 
denote parallel repeat sequences, 

b) selecting for cells being Ml" (if Ml was present on the 
20 vector) and M2*, which cells in their genome comprises the 

structure IR(1) -P-R'-M2-R"-IR(2) or IR(1) -R'-M2-r"-P-IR(2) , 

c) allowing homologous recombination between the DNA sequences 
R' and R M to take place so as to excise the selectable marker 
gene M2 , whereby cells are obtained having integrated the 

25 structure IR(1)-RVR' l -P~IR(2) or IR(1) -P-R'/R 1 1 -IR(2) (wherein 
R'/R' ' denotes the common recombination sequence), and op- 
tionally 

d) repeating steps a-c one or more times to produce bacterial 
cells comprising one or more additional copies of the DNA 

30 structure IR(1) -R'/R' 1 -P-IR(2) or IR(1) -P-R'/R' 1 ~IR(2) . 
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In a still further aspect the invention relates to a method for 
constructing a bacterial cell, which in its genomic DNA has 
integrated more than one copy of a DNA sequence of interest , 
and which is free from a DNA sequence encoding an undesired 
5 selectable marker , which method comprises 

a) introducing a first vector comprising a DNA construct 
according to the fourth aspect of the invention comprising the 
structurelRflJ-R'-MZ-T-R' «-P-IR(2) , IR(1)-P-R , -M2-T-R' f -IR(2) , 
IRflJ-R'-T^-R' '-P-IR(2) or IR(1)-P-R , -T-M2-R' '-IR(2) op- 

10 tionally in association with a selectable marker gene Ml into 
the host cell, in which R 1 and R" denote parallel repeat sequen- 
ces, 

b) selecting for cells being Ml" (if Ml was present on the 
vector) and M2 + , which in their genome comprises the relevant 

15 structure identified in a) , 

c) selecting for cells having an increased number of copies of 
the selectable marker gene M2, 

d) allowing homologous recombination between the DNA sequences 
R 1 and R" to take place so as to excise the selectable marker 

20 gene M2 and the transposase gene T, whereby cells are obtained 
having integrated the structure IR(1) -R'/R 1 , -P-IR(2) or IR(1)- 
P-R'/R 1 '-IR(2) (wherein R'/R" denotes the common recombination 
sequence) , and optionally 

e) repeating steps a-d one or more times to produce bacterial 
25 cells comprising one or more additional copies of the DNA 

structure IRflJ-R'/R 1 f -P-IR(2) or IR(1)-P-R'/R f 1 ~IR(2) • 

It will be noted that integration of the DNA construct accor- 
ding to the first and third aspect, respectively, results in 
the same host cell harbouring the structure IR(1) -R-P-IR(2) or 
30 IR(1)-P-R-IR(2) and that integration of the DNA construct 
according to the second and fourth aspect, respectively, 
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results in the same host cell harbouring the structure IR(1)- 
R'/R f , -P-IR(2) or IR(1)-P-R«/R f '-IR(2) . 

In step a) of any of the above methods, the vector may be 
introduced into the bacterial cell by any method suitable for 

5 the bacterial cell in question. A variety of methods for 
introduction of DNA into bacterial cells are well known in the 
art. These include the use of competent cells (by chemical 
treatment, or "natural" competence) , protoplast trans formation , 
conjugation, electroporation, transduction, or ballistic 

10 transformation. While conjugation may be accomplished by use of 
well-known self-transmissible plasmids as cloning vectors (e.g. 
the plasmid pLS20) , the following specific conjugation method 
has been found to be generally applicable for introduction of 
DNA into cells, e.g. of Bacillus sp., which cannot or only with 

15 difficulty receive DNA by conventional methods such as pro- 
toplast fusion and the like. 

More specifically, conjugation is conveniently achieved by a 
method, in which a population of bacterial donor cells har- 
bouring i) a plasmid comprising a DNA construct of the in- 

20 vention and at least one cis-acting DNA sequence required for 
the transfer of said plasmid by conjugation in the presence of 
a trans-acting mobilizing element, and ii) at least one DNA 
sequence encoding said trans-acting mobilizing element, and a 
population of recipient cells are mixed under conditions 

25 allowing the plasmid to be transferred from the population of 
the donor cells to the population of recipient cells by con- 
jugation. The population of donor cells may, e.g., be a 
population of cells of E. coli or a Bacillus sp., examples of 
which are mentioned further below. The recipient cells are 

30 preferably cells of a Bacillus sp. such as the ones mentioned 
further below. Of particular interest is recipient cells of B. 
licheniformis . B. amvloliquefaciens and B. lentus and non- 
competent cells of B. subtilis . The cis-acting DNA sequence and 
preferred locations thereof in the DNA construct or vector of 
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the invention are described in the section above entitled "Cis- 
acting conjugational elements". 

The term "trans-acting mobilizing element" in intended to 
indicate a protein mediating conjugative transfer of DNA 

5 sequences containing the cis-acting DNA sequence defined above. 
The trans-acting mobilizing element may be a protein encoded by 
a conjugational plasmid, such as the Bacillus sp. plasmid pLS20 
(Koehler and Thorne, Journal of Bacteriology, Nov. 1987, pp. 
5771-5278), or a part or derivative thereof, or may be a 

10 protein encoded by a DNA sequence such as orf-fl of the plasmids 
pBC16 or pUBHO (Sellinger et al., Journal of Bacteriology, 
June 1990, pp. 3290-3297) or a functional analogue or part 
thereof. It will be understood that since the mobilizing 
element is acting in trans it may be encoded by DNA present in 

15 the genome of the donor cell or on a second plasmid present in 
said donor cell. 

The mixing of the cells is conveniently performed by mixing the 
donor and recipient cells and allow them to stand for at least 
4 hours at 30-37 'C and selecting for recipient cells having 
20 received the DNA construct. See Selinger et al., 1990, for 
further reference. 

The cultivation conditions to be used in step b) depend, inter 
alia, on the kind of selectable marker. For instance, when the 
selectable marker encoded by M2 is an antibiotic resistance the 
25 cultivation of b) is performed in the presence of a suitable 
dosage of the antibiotic in question so as to select cells 
having received and expressing M2 . 

Selection and screening are considerably facilitated by the 
presence of the antibiotic resistance markers Ml and M2 on the 
30 DNA construct which is introduced into the recepient host cell. 
The marker Ml being present on the "vector part" of the DNA 
construct, i.e. the part of the construct outside the structure 
defined by IR(1) and IR(2) , whereas the marker gene M2 is 
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present within said structure. However, it may be noted that 
succesfull transposition can be obtained also when the DNA 
construct comprises only one selectable marker, i.e. either Ml 
or M2. 

5 Following a suitable growth regime, e.g. as given in the li- 
terature for pHV1248 (Petit, M.-A., Bruand, C, Janniere, L. , 
Ehrlich, S. D. (1990), TnlO-derived transposons active in Ba- 
cillus subtilis, J. Bacteriol., 172, 6736-6740), strains con- 
taining M2 are selected on plates containing the proper antibi- 

10 otic, and screened by replica plating for the absence of Ml. 
Such strains no longer harbour the transposon delivery vector, 
and has the transposable structure integrated into their 
genome. Integration of the DNA construct by transposition may 
occur at random positions throughout the genome of the host 

15 cell. Thus, when multicopy cells are produced the integrated 
copies will be located at random, separate positions of the 
cell which contributes to increasing the genetic stability of 
the cell. 

The presence of P may be evaluated by methods known in the art, 
20 e.g., as appropriate, by southern analysis, PCR amplification, 
or phenotypic expression of P. When the DNA construct to be 
introduced into the cell harbours a selectable marker gene Ml 
the loss of the Ml phenotype is used to indicate loss of the 
vector part of the introduced DNA once transposition has taken 
25 place. 

When the DNA construct according to the first or third aspect 
of the invention is used (i.e. a DNA construction comprising 
target sequences for a site-specific recombinase enzyme) a 
subsequent step is to introduce a second vector expressing the 
30 site-specific recombination enzyme cognate to the target sequ- 
ences flanking M2 into the strain having recieved the first 
vector. The activity of the recombination enzyme recombines the 
two target sequences, which in the genome flanks M2 and 
optionally T, thus deleting this marker gene and optionally T, 
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an event which is easily detected by screening, e.g. by replica 
plating. 

Once M2-free strains are obtained, these are cured for the 
plasmid expressing the recombination enzyme, e.g. by propaga- 
5 tion at non-permissive temperature if the vector is a tempera- 
ture-sensitive replicon. Alternatively, the vector is allowed 
a certain time of residence in the host, which is then cured 
for the vector, and the plasmid free cells screened for loss of 
M2. 

io The result of this process is a strain that contains one trans- 
posed copy of the gene of interest, but no marker gene. 
This strain is now used as a host strain in a new round of 
transposition and marker deletion, in which it is possible to 
employ exactly the same modified transposon delivery vector and 

is recombination enzyme encoding vector as in the first round, 
i.e. 

i) the modified transposon delivery vector is introduced into 
the host strain, 

ii) strains containing a transposed copy of the DNA sequence of 
20 interest plus the marker gene M2, and free of transposon deli- 
very vector, are obtained as described above, 

iii) the vector expressing the cognate site specific recombi- 
nation enzyme is introduced into these strains (in case of the 
DNA construct according to the third aspect) , and 

25 iv) strains from which the marker gene M2 and optionally T is 
deleted due to the activity of the recombination enzyme (when 
constructed from a DNA construct according to the first or 
third aspect) or homologous recombination (when constructed 
from a DNA construct according to the second or fourth aspect) 

30 are isolated as above, and plasmid free versions of such 
strains obtained. 
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The result of this is now a strain containing two transposed 

copies of the gene of interest, but no marker gene* 

This can again serve as host strain in a new round of trans- 
position and marker deletion, and the process can be repeated 
5 essentially an infinite number of times. 

If the DNA sequence encoding the recombinase enzyme is provided 
in cis under a regulatable promoter (as discussed above) , only 
one transformation step need to be done in each round. In 
practice when transposition has been accomplished the regula- 
10 table promoter is switched on and the recombinase enzyme 
expressed* 

Marker deletion by the site-specific recombination enzyme lea- 
ves one copy of the cognate target sequence in the chromosome. 
Subsequent transposition events brings in new copies of this 

15 target sequence. As transposition can occur to a substantial 
number of sites within the genome, approaching random integra- 
tion, the new target sequences will with high probability be 
situated at a considerable distance from the first one. It is 
likely that the site-specific recombination enzyme may function 

20 less efficiently on very separated sequences. It is furthermore 
likely that the deletion of a substantial amount of genomic 
DNA, which would result from such a recombination event, would 
be lethal, so strains where this has taken place would not be 
isolated by the described procedure. 

25 An analogous process is carried out in connection with strains 
produced by use of a DNA construct according to the second or 
fourth aspect of the invention. In this case the excision of 
the marker gene M2 and, when relevant T, is obtained by 
homologous recombination between the sequences R 1 and R' 1 . 
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The cell of the invention 

In a further aspect the invention relates to a bacterial cell, 
which in its genome has integrated more than one copy of a DNA 
construct comprising the following structure IR(1) -P-IR(2) , in 
5 which 

IR(1) and IR(2) denote transposase target sequences, and 
P a DNA sequence of interest, wherein the structure IR(1)-P- 
IR(2) does not encode an undesired selectable marker. The 
structure further comprises a recombination target sequence R 
10 or the common homologous recombination sequence R7R" inbet- 
ween IR(1) and IR(2) . 

In the present context the term "genome" is intended to 
indicate the constituent DNA of the cell comprising the 
chromosome and stably inherited extrachromosomal elements. 

15 In the present context the cell into which the DNA construct is 
to be integrated is denoted host cell, host strain, recipient 
strain or cell, modified cell or the like. It will be under- 
stood that these terms are used interchangeably. 

The presence of IR(1) and IR(2) in the genome of the cell is 
20 not essential for the function of the cell, but is evidence to 
the fact that the cell has been constructed by transposition, 
i.e. by use of transposons. Thus, only cells having been 
modified by transposition comprise the above structure. If 
necessary, the IR(1) and/or IR(2) may be deleted or inactivated 
25 from the cell by conventional means. 

In a further aspect the invention relates to a bacterial cell, 
which in its genome has integrated at least two copies of a DNA 
construct comprising the following structure IR(1) -P-IR(2) , in 
which 

30 IR(1) and IR(2) denote transposase target sequences, and 
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P a DNA sequence of interest, the structure further comprising 
a recombination target sequence R or the common homologous 
recombination sequence R'/R" inbetween IR(1) and IR(2) . 

It is presently believed that the present disclosure is the 
5 first description of the use of transposition for construction 
of multicopy grampositive cells, in particular Bacillus cells. 
The presence of multiple, normally randomly located copies of 
the structure IR(1) -P-IR(2) further comprising R or R'/R 11 in 
the genome of a cell shows that the cell has been constructed 
io by transposition. 

From the above disclosure it will be apparent that the present 
invention is a very convenient and efficient method of con- 
structing multicopy gram-positive bacterial strains, i.e. bac- 
terial cells which integrated in their genome comprises 

15 multiple copies of a DNA sequence of interest. The cells 
produced according to the present invention are either a) cells 
which in their genome has integrated at least two copies of a 
DNA construct comprising the structure IR(1) -P-IR(2) , the 
structure further comprising a recombination target sequence R 

20 or the common homologous recombination sequence R'/R 1 1 inbet- 
ween IR(1) and IR(2), which structure is free from any gene 
encoding an undesirable, e.g. antibiotic resistance marker. 

Of particular interest is a cell which in its genome (also 
outside the structure IR(1)-P-IR(2) further comprising a DNA 

25 sequence R or R'/R 11 ) is free from a gene encoding an antibi- 
otic resistance marker or another undesired marker type. The 
presence of such genes normally follows when multicopy strains 
are constructed by conventional methods, including trans- 
position when based on transposition delivery vectors known in 

30 the art. 

Although the cell of the invention is preferably free from any 
introduced selectable marker it may contain a gene encoding a 
harmless (i.e. not undesirable) selectable marker such as a a 
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growth-inhibiting marker or an antibiotic-resistance marker • 
These types of markers are discussed in detail above in the 
section entitled "Selection marker". 

The cell of the invention may be of any gram-positive bacterial 
5 species, in particular a cell of a Bacillus sp. or a Lac- 
tobacillus sp. Examples of suitable cells of Bacillus may be 
selected from the group consisting of Bacillus subtilis . 
Bacillus lichenif ormis , Bacillus lentus . Bacillus brevis . 
Bacillus stearothermophilus . Bacillus alkalophilus . Bacillus 
10 amvloliouef aciens . Bacillus coaaulans . Bacillus circulans . 
Bacillus lautus . Bacillus meqaterium . Bacillus thurinaiensis . 
In connection with the use of B. lentus it may be noted that 
there has been no prior disclosure or indication to the fact 
that transposition can work in cells of of said species. 

15 As to the DNA sequence of interest present in the cell of the 
invention reference is made to the section "DNA sequence of 
interest" discussed above. 

Finally, it will be understood that under circumstances where 
there is no desire of constructing a marker-free, single or 

20 multicopy strain M2 need not be eliminated after having served 
the purpose of identifying cells which in their genome comprise 
one or more copies of the DNA sequence of interest. In such 
cases transposition serves as a convenient method of introdu- 
cing at least one and preferably multiple copies of a DNA 

25 sequence of interest into the genome of a cell. 

Production of a polypeptide of interest 

The cell of the invention comprising more than one copy of the 
structure IR(1) -P-IR(2) further comprising a DNA sequence R or 
R'/R' 1 inbetween IR(1) and IR(2) integrated into its genome or 
30 a cell prepared by a method of the invention as described above 
may suitably be used in a method for producing a polypeptide of 
interest which is encoded by a DNA sequence P of interest. 
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The method comprises cultivating the cells in question in a 
suitable nutrient medium under conditions permitting the 
expression of the polypeptide , after which the resulting 
polypeptide is recovered from the culture. 

5 The medium used to culture the cells may be any conventional 
medium suitable for growing the cells, such as minimal or 
complex media containing appropriate supplements. Suitable 
media are available from commercial suppliers or may be 
prepared according to published recipes (e.g. in catalogues of 

10 the American Type Culture Collection) . The polypeptide produced 
by the cells may then be recovered from the culture medium by 
conventional procedures including separating the cells from the 
medium by centrifugation or filtration, precipitating the 
proteinaceous components of the supernatant or filtrate by 

15 means of a salt, e.g. ammonium sulphate, purification by a 
variety of chromatographic procedures, e.g. ion exchange 
chromatography, gel filtration chromatography, affinity chroma- 
tography, or the like, dependent on the type of polypeptide in 
question. 

20 The polypeptide is preferably a translocated polypeptide, in 
particular a secreted polypeptide such as a secreted enzyme. 
Alternatively, the translocated polypeptide is PrsA. Specific 
examples of polypeptides which may be produced in accordance 
with this aspect of the invention are given above in the 

25 section entitled "The DNA sequence of interest". 

Conclusion 

From the above description of the invention it will be apparent 
that the present invention is novel and inventive over the 
prior art for one or more of the following reasons: 

30 1) transposons are used for insertion of more than one copy and 
a fixed, predetermined number of copies of a gene encoding a 
product of interest, 
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2) the transposed DNA contains elements that allow site-speci- 
fic deletion of parts of the transposed DNA, 

3) the resulting strains contain transposed DNA, but have no 
selection marker genes within the transposed DNA, and/or 

5 4) transposons are used in an iterative process, made possible 
by the combination with a method to specifically delete marker 
genes . 

From an industrial point of view, bacterial strains constructed 
in accordance with the invention may have the triple advantage 
io of being 

i) high-yielding due to the presence of multiple gene copies, 

ii) Genetically extremely stable, as the multiple copies are 
not tandem repeats but can be scattered over the chromosome, 
and 

15 iii) Marker free, making them more "environmentally friendly" 
than traditional recombinant production strains containing 
antibiotic resistance genes. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The invention is further described in the following examples 
20 with reference to the appended drawings, in which the following 
abbreviations are used: 

" bla " indicates the beta-lactamase gene, encoding ampicillin 
resistance, from pUC19. 

" erm " indicates the erythromycin resistance gene of pE194« 
25 " cat " indicates the chloramphenicol resistance gene derived 
from pC194. 

M IR" indicates the transposon inverted repeat sequence derived 
from TnlO. 

" Tn ase " indicates the transposase from TnlO, modified to be 
30 expressed in Bacillus . 

" PamvO-sav " indicates the Savinase gene, expressed from the 
promoter for the B. arovloliouef aciens alpha-amylase gene. 
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" oriTfpUBHO) " indicates the cis-acting sequence of pUBHO 
necessary for mobilization. 

" ermR " indicates the erythromycin resistance gene derived from 
pAM/Jl. 

5 " repE " indicates the replication protein gene of pAM01. 

" resB " indicates the resolvase gene of pAM/91. 

" topB " indicates the topoisomerase gene of pAM/91. 

" amv 1 " indicates the 5 end of the B. subtil is amylase gene. 

" lacZ " indicates the beta-galactosidase gene from E. coli . 
10 " res " indicates the target site for the resolvase from pAM01. 

" spc " indicates the spectinomycin resistance gene from Tn554. 

" 1 amy " indicates the 3* end of the B. subtilis amylase gene. 

" + ori pUBHO " indicates the replication origin of pUBHO (the 

nick site) . 

15 "rep" indicates the replication protein gene of pUBHO. 
" kan " indicates the kanamycin resistance gene of pUBHO. 
" PamvL " indicates the promoter of the B. lichenif ormis alpha- 
amylase gene. 

" kan' " indicates the 5 end of the kanamycin resistance gene of 
20 pUBHO. 

" ! amyL " indicates the 3 end of the B. lichenif ormis alpha-amy- 
lase gene. 

" repF " indicates the replication protein gene of pE194 . 
" Plac " indicates the beta-galactosidase promoter on pUC19. 
25 " amvL " indicates the alpha-amylase gene from B. licheniformis. 

Fig. 1 is a restriction map of plasmid pMOL553; 

Fig. 2 is a restriction map of plasmid pSJ3282; 

Fig. 3 is a restriction map of plasmid pWT; 

Fig. 4 is a restriction map of plasmid pMAP29; 
30 Fig. 5 is a restriction map of plasmid pSJ3157; 

Fig. 6 is a restriction map of plasmid pSJ2739; 

Fig. 7 is a restriction map of plasmids pSJ3279-pSJ3281; 

Fig. 8 is a schematic representation of the oligonucleotide 

primers and amplified DNA fragments used for the construction 
35 of the res - cat - res -IR segment eventually incorporated into 

plasmids pSJ3389-pSJ3390 ; 
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Fig. 9 is a restriction map of plasmids pSJ3 372-pSJ3376 ; 
Fig. 10 is a restriction map of plasmids pSJ3389-pSJ3390 ; 
Fig. 11A shows the result of a Mancini immunodiffusion assay 
for Savinase; 

5 Fig. 11B shows the result of a Mancini immunodiffusion assay 
for Savinase; 

Fig. 12A is a Southern blot, using DNA from the strains indica- 
ted; 

Fig. 12B is a Southern blot, using DNA from the strains as in 



io Fig. 


12A; 
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Fig. 


20 


is 


a 


restriction 


map 


of 


plasmids pSJ3444 


-pSJ3445; 


Fig. 


21 


is 


a 


restriction 


map 


of 
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plasmids pSJ3524- 


-psJ3527. 



The invention is further illustrated in the following examples 
which are not in any way intended to limit the scope of the 
30 invention as claimed. 

MATERIALS AND METHODS 

In vitro DNA work, transformation of bacterial strains etc. 
were performed using standard methods of molecular biology 
(Maniatis, T. , Fritsch, E. F. , Sambrook, J. "Molecular Cloning. 
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A laboratory manual". Cold Spring Harbor Laboratories, 1982; 
Ausubel, F. M. , et al. (eds.) "Current Protocols in Molecular 
Biology". John Wiley and Sons, 1995; Harwood, C. R. , and 
Cutting, S. M. (eds.) "Molecular Biological Methods for 
5 Bacillus". John Wiley and Sons, 1990). Enzymes for DNA manipu- 
lations were used according to the specifications of the 
suppliers. 

Media used (TY, BPX and LB agar) have been described in EP 0 
506 780. LBPSG agar is LB agar supplemented with phosphate 
io (0.01 M KjPOJ , glucose (0.4 %) , and starch (0.5 %) 

EXAMPLE 1 

Construction of a transposon delivery vector for transposition 
of a Savinase* gene 

Transposon donor plasmid pHV1248 (Petit, M.-A., Bruand, C. , 
15 Janniere, L. , Ehrlich, S. D. (1990) TnlO-derived transposons 
active in Bacillus subtilis . J. Bacteriol., 172, 6736-6740) was 
used as starting material. This plasmid contain the pE194 
replicon which is thermosensitive for replication, and carry a 
transposase gene from TnlO modified to be expressed in Bacillus 
20 subtilis . and sufficient sequences from the IS10 elements of 
TnlO flanking a chloramphenicol resistance gene (mini-TnlO) to 
allow transposition of mini-TnlO into the Bacillus subtilis 
chromosome. pHV1248 was introduced into E. coli SJ6 (Diderich- 
sen et al., 1990, J. Bacteriol. 172, 4315-4321), selecting 
25 ampicillin resistance (100 ng/ml) , to give SJ1609. Strain 
SJ1609 was deposited with DSM as DSM10445, as a patent deposit 
in accordance with the Budapest treaty, on December 22, 1995. 

Savinase* is the extracellular, alkaline protease from Bacillus 
lentus . A vector for the delivery of this gene by transposition 
30 is pMOL553. This plasmid was constructed in two steps. 
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Step 1. 

SOE PCR (Horton, R. M. , Hunt, H. D. , Ho, S. N. , Pullen, J . K. , 
Pease, L. R. (1989) Engineering hybrid genes without the use of 
restriction enzymes: gene splicing by overlap extension. Gene, 

5 77, 61-68) was used to insert a BamHI site upstream of the cat 
gene in pHV1248. Two seperate PCR reactions were performed 
using pHV1248 as template. In the first PCR reaction was used 
primer 1 (LWN5037) : CCCACT£G^TC£AATTTTCGTTTGTTG and primer 2 
(LWN5038: GCAAATTGATCCAAGAGAACCAAC . The underlined bases in 

io primer 1 show the position of the BamHI site. The second PCR 
reaction was based on primer 3 (LWN5036) : CAACAAACGAAAATTGGATC- 
CAGTGGG and primer 4 (LWN5039) : GCACATCATCATCATAAGC. Both PCR 
reactions were performed by standard procedures using tempera- 
tures of 96°C at denaturation, 55°C at annealing and 72 °C at 

15 the extension step. A total of 20 cycles were performed. Both 
fragments were purified from an agarose gel and 500 ng of each 
were used for a second 5 cycle PCR reaction: 96°C for 2 min. , 
50 °C for 5 min and 72 "C for 1 min. Primer 2 and primer 4 (100 
pmol) were added at 96 °C and a third 25 cycle PCR reaction was 

20 initiated: 96°C for 30 sec, 55*C for 30 sec. and 72°C for 90 
sec. The final PCR fragment of 1330 bp was digested with Hin- 
dlll, ligated to Hindlll digested pHV1248, and the ligation 
mixture transformed into E. coli SJ2 (Diderichsen et al., 1990, 
J. Bacterid. 172, 4315-4321). One transformant kept, MOL612, 

25 contained plasmid pMOL610. The position of the BamHI site in 
PMOL610 was verified by restriction digest. 

Step 2 

In this step the entire Savinase® gene was cloned into the Bam- 
HI site of pMOL610. The Savinase* gene and a promoter region 

30 was amplified from plasmid pSX222 (WO 92/11357) by PCR using 
primers with BamHI restriction sites (underlined) , primer 5 
(LWN5136) : CCGGCGGATCCAAGGGGTGATCG and primer 6 (LWN2043) : GGG- 
GTACTAGTAACCCGGGCCCGGCGTAGA GGATCC ATACACAAA . This Savinase* 
gene encodes the wild type Savinase* enzyme, but the DNA 

35 sequence is modified to contain a number of restriction enzyme 
sites within the Savinase* coding sequence. The PCR reaction 
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was performed as follows: 96°C for 30 sec, 55°C for 30 sec, 
and 72 *C for 120 sec. After 20 cycles the PCR fragment was 
BamHI digested, purified and cloned into the BamHI site of 
PMOL610. The ligation mixture was transformed into E. coli SJ2, 
sand one transformant kept as MOL553 (SJ2 / pM0L553) . The 
cloning was verified by restriction digests and a distinct 
protease phenotype in B. subtil is (e.g. in strain DN1885 (Di- 
derichsen et al. # 1990, J. Bacteriol. 172, 4315-4321), or 
protease-def icient derivatives of this strain) . 

io The transposon delivery vector encoding Savinase* thus constru- 
cted is pMOL553. The full sequence of this plasmid is given as 
SEQUENCE ID no. i, and a restriction map in figure 1. 

EXAMPLE 2 

Transposition of a Savinase* gene into the B. lentus chromo- 
15 some, using selection for chloramphenicol resistance 

The transposon delivery plasmid pMOL553 was used to insert 
extra protease genes randomly in the chromosome of B. lentus . 

A mutant of the alkalophile Bacillus lentus strain C360 (CA 
954807) was transformed with the plasmid pM0L553 by protoplast 

20 transformation according to a method almost identical to the 
HCP-1,5 overlayer procedure described by Akamatsu (Akamatsu, 
T. et al. (1984), Agric. Biol. Chem. 48(3), 651-655). The only 
major difference was that pH of the HCP-1,5 media and agar was 
changed to pH 9 to allow the alkalophilic bacilli to regenerate 

25 under more suitable conditions. The regeneration plates 
containing either 10 /ig/ml chloramphenicol or 5 Mg/ml erythro- 
mycin were incubated from 5 to 10 days at 3 0°C. 

After reisolating the transf ormants on LB9 plates (LB medium 
containing 0.05 M NaHC0 3 buffer pH 9.0) with 2 Mg/ml erythromy- 
30 cin, the mini TnlO transposon carrying the protease gene as 
well as the cat gene was allowed to transpose according to the 
method described by Petit et al., 1990. After overnight incuba- 
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tion in liquid medium at 46 *C, cultures were plated on LB9 + 
6Mg/ml chloramphenicol at 46 *C and colonies screened for eryth- 
romycin sensitivity. Eleven erythromycin sensitive, chloramphe- 
nicol resistent colonies were selected. A Southern analysis of 

5 the chromosomal DNA isolated from these 11 colonies revealed 
that each contained a mini transposon inserted in the chromoso- 
me in one copy. 10 strains contained identical insertions, 
whereas one contained an insertion in a different position. The 
10 identical strains most likely represent siblings from an 

io early transposition event. None of the transposons had inserted 
into the gene coding for the native protease situated in the B. 
lentus chromosome . 

EXAMPLE 3 

Transposition of a Savinase* gene into the Bacillus lentus 
15 chromosome, without selection for chloramphenicol resistance 

In an experiment almost identical to that of example 2, a JL. 
lentus strain with a chromosomal deletion in the alkaline 
protease gene was transformed with pM0L553 by the above mentio- 
ned protoplast procedure. Throughout this experiment the se- 

20 lection pressure was only for erythromycin resistance, con- 
ferred by the vector part of pM0L553. The idea was to isolate 
cells that had gained an active protease gene by transposition 
of the protease-carrying mini transposon into the chromosome, 
by screening for the ability to degrade skimmed milk on an 

25 LB (9) agar plate. The protoplast transformation and reisolation 
steps were performed with erythromycin selection only. The 
integration period at 30 °C allowing the transposon to integrate 
was one day in LB (9) medium with 2/ig/ml erythromycin. After one 
day in LB (9) medium without selection pressure at 46 °C the 

30 cells were spread on LB (9) plates with 1% skimmed milk, but no 
antibiotics. Among 600 colonies 59 were protease positive. 
These 59 colonies were subsequently analyzed with respect to 
their antibiotic resistance phenotypes. 19 colonies were re- 
sistant to both erythromycin and chloramphenicol indicating 

35 that they still contained pMOL553, while 40 colonies were re- 
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sistant to chloramphenicol only, indicating that they had ari- 
sen through transposition of the cat and Savinase* genes onto 
the chromosome. 

Two of the chloramphenicol resistant and erythromycin sensitive 
5 colonies were analyzed by Southern hybridization. Both cells 
contained two copies of the protease carrying mini transposon 
situated in the chromosome at locations different from the 
location of the (in this case partly deleted) native protease 
gene. All four mini transposon integrations were at different 
10 positions in the B. lentus chromosome. 

This example thus illustrates the construction of a strain 
containing multiple, transposed copies of a sequence of inter- 
est, encoding a translocated polypeptide, and illustrates the 
isolation of strains containing transposon insertions, without 
15 the use of a selectable marker in the strain isolation. 

EXAMPLE 4 

Construction pf 3 dpnor strain for conjugative transfer of 

plfrgpids containing oriT from pvpiip 

Plasmids pLS20 and pBC16 can be transferred by conjugation from 
20 B. subtilis strain PSL1 UM13 into various Bacillus recipient 
strains (Koehler, T. M. and Thorne, C. B. (1987). Bacillus 
subtilis f natto ) plasmid pLS20 mediates interspecies plasmid 
transfer. J. Bacterid., 169, 5271-5278). 

DN1280 is a derivative of B. subtilis 168, containing a dele- 
25 tion in the dal gene (Diderichsen, B. (1986) . A genetic system 
for stabilization of cloned genes in Bacillus subtilis , p35-46. 
In A. T. Ganesan and J. A. Hoch (eds. ), Bacillus molecular ge- 
netics and biotechnology applications. Academic Press, Inc., 
New York) . DN1280 was rendered competent and transformed with 
30 plasmid pHV1248, selecting for erythromycin resistance (5 
jig/ml) at 30 *C. The resulting strain was used as recipient in 
conjugation with PSL1 UM13. Both strains, taken from plates 
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incubated overnight, were mixed on an LB PS 6 plate (an LB plate 
supplemented with phosphate (0.01 M K3POJ , glucose (0.4 %) , and 
starch (0.5 %)) with D-alanine (100 /ig/ml) , and incubated for 
5 hours at 30*C. The plate was then replicated onto an LBPSG 
5 plate as above, but in addition containing erythromycin (5 
Mg/ml) and tetracycline (5 ng/ml) . Single colonies appearing on 
the replica plate was assayed for their ability to transfer 
pBC16 into B. subtil is DN1885. Conjugation was performed by 
mixing of the strains on LBPSG plates as above and incubation 
10 for 5 hours at 30 *C. Replication was to LBPSG plates with 
tetracycline (5 jxg/ml) , but without D-alanine. 

The omission of D-alanine effectively counterselects the dal- 
donor strain. A few of the colonies assayed were able to 
transfer the Tet R marker into DN1885. This indicates that these 

15 colonies all harbour pLS20 in addition to pBC16. One such 
colony was propagated at 50 'C in liquid TY medium containing 
tetracycline (5 fig/ml) and D-alanine (100 Mg/ml) , subsequently 
plated on LB containing tetracycline (5 tig/ml) and D-alanine 
(100 jxg/ml), and replica plated onto LB containing D-alanine 

20 (100 jig/ml) and erythromycin (5 jxg/ml) or chloramphenicol (6 
Mg/ml), respectively. An tetracycline resistant, erythromycin 
and chloramphenicol sensitive isolate was kept as PP289-5. 
This strain, which is dal- and contains pLS20 and pBC16, can 
serve as a conjugation donor strain that allows the transfer of 

25 plasmids containing the pUBHO oriT into various recipient 
strains. 

EXAMPLE 5 

Constructio n of a mobilizable transposon delivery vector for 
transposition of a Savinase* gene 

30 Mobilization of plasmid pUBHO by pLS20 or its derivatives has 
been described and analyzed in some details (Koehler, T. M. and 
Thorne, C. B. (1987). Bacillus subtilis (natto) plasmid pLS20 
mediates interspecies plasmid transfer. J. Bacterid., 169, 
5271-5278; Selinger, L. B. , McGregor, N. F. , Khachatourians, G. 
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G. and Hynes, M. F. (1990). Mobilization of closely related 
plasmids pUBHO and pBC16 by Bacillus plasmid pXO503 requires 
trans-acting open reading frame 0. J. Bacteriol., 172, 3290- 
3297). In this invention elements from these plasmids were used 
5 to mobilize transposon delivery vectors. 

Mobilization of pUBHO is dependent on a cis acting region 
(oriT) located 5 f to orf)9 (Selinger et al.,1990). A 555 bp seg- 
ment from pUBHO, extending from pos. 1020 to pos. 1575 in the 
pUBHO sequence (McKenzie, T., Hoshino, T. , Tanaka, T. , Sueoka, 
io N. (1986) The nucleotide sequence of pUBHO: some salient 
features in relation to replication and its regulation. Plasmid 
15, 93-103) , was PCR amplified using primers LWN5232 and 
LWN5233. 
LWN5232: 

15 5 1 -GTCGGAGCTCATTATTAATCTGTTCAGCAATCGGGC-3 1 
LWN5233: 

5 1 -GTCGGAGCTCTGCCTTTTAGTCCAGCTGATTTCAC-3 1 

The amplified fragment was digested with Sad and initially 
cloned into the Sad site of an E. coli plasmid (a pUC19 der- 
20 ivative) . The fragment was subsequently excised again using 
Sad, and cloned into the unique SacI site in the previously 
described plasmid pMOL553. The ligation mixture was transformed 
into E. coli SJ2 selecting for ampicillin resistance (100 
/xg/ml) . 

25 Plasmids were prepared from the pooled transf ormants, and the 
plasmid mixture transf ormed into PP289-5, selecting resistance 
to tetracycline (5 /xg/ml) » chloramphenicol (6 Mg/ml) and eryt- 
hromycin (5 jug/ml) on LBPSG plates containing D-alanine (100 
/xg/ml) • The transf ormants were again pooled, and the pool used 

30 to transfer the oriT-derivative of pM0L553 into DN1885 by 
conjugation, using the method described above in EXAMPLE 4. 
Finally, the identity of the plasmids in the transconjugants 
was verified by restriction mapping, and a strain containing a 
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correct plasmid was kept, as SJ3282 (DN1885/pSJ3282 (-pMOL553- 
oriT; Fig. 2) ) . 

EXAMPLE 6 

Construction of a mob ilizable plasmid expressing the oAMfll 
5 resolvase 

Plasmids pWT and pMAP29 were used as starting material for the 
following experiments. pWT is a high copy number pAM01 deriva- 
tive with the restriction map given in fig. 3. It was transfor- 
med into B. subtil is DN1885, and a transformant kept as strain 

io SJ3008. This strain was deposited with the DSM as a patent 
deposit in accordance with the Budapest treaty, as DSM10444 on 
December 22, 1995. pMAP29 contains two directly repeated 
resolution sites ( res ) flanking a spectinomycin resistance 
determinant, and allows insertion of the res/spc/res structure 

15 in the amy locus of B. subtilis . It has the map shown in fig. 
4. It is kept in an E. coli strain as SJ3007. This strain was 
deposited with the DSM as a patent deposit in accordance with 
the Budapest treaty, as DSM10446 on December 22, 1995. 

pMAP29 was digested with Seal, and transformed into competent 
20 DN1885 selecting spectinomycin resistance (60 Mg/ml) . One such 
transformant was kept as SJ3109. SJ3109 was rendered competent, 
and maintained its spectinomycin resistance as competent cells. 
pWT, prepared from SJ3008, was transformed into SJ3109 selec- 
ting erythromycin resistance (5 jxg/ml) at 30 °C. Of 24 trans- 
25 formant colonies tested, two were spectinomycin sensitive. 
5 spectinomycin resistant transf ormants were propagated in TY 
medium with erythromycin (2 vg/ml) for 2 days at 3 0 °C. Replica 
plating from these cultures revealed only spectinomycin sensi- 
tive cells in three, but still spectinomycin resistant cells in 
30 the other two. Plasmid pWT was easily cured from the cells by 
restreaking two times at 50 °C. 

Thus, resolvase as expressed from pWT easily excises the spec- 
tinomycin resistance gene from the chromosome of SJ3109. 
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The sequence of the pAM£l resolvase gene is known, as the re- 
solvase is encoded by ORF H as published by Swinfield et al. 
(Swinfield, T.-J., Janniere, L. , Ehrlich, S. D. , Minton, N. P. 
(1991) . Characterization of a region of the Enterococ cus faeca- 
5 lis p lasmid pAM01 which enhances the segregational stability of 
pAM01-derived cloning vectors in Bacillus subtilis . Plasmid 26 , 
209-221) . 

The gene was amplified by a PCR reaction, using as template pWT 
and the following primers: 

10 BamHI PstI < pos. 5102-5129 > 

LWN7 839: 5 ' -GACGGGATCCCTGCAGTATCCAATTTATTTTTTTCTTAACAAGG- 3 1 

EcoRI HindllK pos. 5820-5797 > 
LWN7840 : 5 1 -GACGGAATTCAAAGCTTAAAGCACTTGCATAGGCTAATGCC-3 1 

15 where the sequence numbering is taken from the above reference. 

The amplified DNA fragment was digested with restriction en- 
zymes PstI and Hindlll, and subsequently ligated to the 4.1 kb 
Pstl-Hindlll fragment isolated from pDN1981 (Jorgensen, P. L. , 
Hansen, C. K. , Poulsen, G. B., Diderichsen, B. (1990). In vivo 

20 genetic engineering: homologous recombination as a tool for 
plasmid construction. Gene 96, 37-41) and transformed into B^ 
subtilis DN1885 selecting kanamycin resistance (10 tig/ml) . Two 
transformants were kept, SJ3157 containing pSJ3157 (fig. 5) , 
and SJ3158 containing pSJ3158 (the PstI site in pSJ3158 was 

25 destroyed in the cloning, otherwise the plasmids are identi- 
cal) . pSJ3157 and pSJ3158 are examples of the plasmid pAmyLres, 
containing the resolvase gene expressed from the B. lichenifor- 
mis amvL promoter, on a pUBHO derived vector conferring 
kanamycin resistance. 

30 These two plasmids and pDN1981 as a control were transformed 
into SJ3109. Kanamycin and spectinomycin resistant colonies (5 
with each plasmid) were incubated in TY with kanamycin for 3 
days at 30 °C, then spread for replica plating: All resulting 
colonies containing pSJ3157 or pSJ3158 were spectinomycin sen- 
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sitive, whereas colonies containing pDN1981 were spectinomycin 
resistant. 

Resolvase expressed from the Termamyl* (ajnyL) promoter thus 
works efficiently in promoting recombination between res sites 
5 in the Bacillus chromosome. 

In a subsequent step, the amyL promoter and resolvase gene was 
transferred onto a mobilizable, temperature sensitive cloning 
vector, pSJ2739 (fig. 6). This vector contains the replication 
functions and erythromycin resistance gene of pE194 

10 (Horinouchi, S. , and Weisblum, B. (1982). Nucleotide sequence 
and functional map of pE194, a plasmid that specifies inducible 
resistance to macrolide, lincosamide, and streptogramin type B 
antibiotics. J. Bacteriol., 150, 804-814), oriT of pUBHO, part 
of the kanamycin resistance gene of pUBHO, and a segment of 

15 the alpha-amylase gene ( amvL ) from Bacillus licheni f ormis. Its 
full sequence is given as SEQUENCE ID no. 2. 

Genes expressed from the amyL promoter on e.g. pDN1981 can con- 
veniently be transferred onto pSJ2739 using Bglll and e.g. 
Hindlll. This cloning restores the kanamycin resistance gene. 

20 pSJ3157 and pSJ3158, respectively, were digested with Hindlll 
and Bglll, the 2.3 kb fragments ligated to the 5.4 kb Bglll- 
Hindlll fragment of pSJ2739, and the mixture transformed into 
competent DN1885 selecting kanamycin (10 ng/ml) and erythromy- 
cin (5 jxg/ml) resistance at 3 0 °C. Three trans formants kept were 

25 SJ3279 (pSJ3279; from pSJ3157) 

SJ3280 (pSJ3280; from pSJ3157) and 

SJ3281 (pSJ3281; from pSJ3158) . 

Their restriction map is shown in fig. 7. 

These plasmids are examples of the plasmid pAmyLres (ts) . 

30 Functionality of these plasmids were assayed by transformation 
of strain SJ3109; Propagation of SJ3109 containing each plasmid 
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resulted in the appearance of spectinomycin-sensitive cells, 
and the plasmids were subsequently easily cured from the cells 
by growth without antibiotics at 4 6 °C. 

EXAMPLE 7 

5 Construction of a donor strain for coniuaative transfer of the 
resolvgse expressing Plaspjfl 

The plasmids constructed in EXAMPLE 6 were transformed into 
competent cells of the conjugation donor strain PP289-5 (dal-, 
pLS20, pBC16) to allow their easy transfer into other Bacillus 
io strains by conjugation. The following strains were kept: 
SJ3308 = PP289-5 / pSJ3279 
SJ3309 o PP289-5 / pSJ3280 
SJ3310 = PP289-5 / pSJ3281 

These strains were able to mobilize the resolvase-expressing 
is plasmid into strain SJ3109, using the procedure for conjugation 
previously described, where it functioned to excise the 
spectinomycin resistance gene from the chromosome. 

EXAMPLE 8 

Insertion of resolvase target sites, res, around the cat marker 
20 on the m obilizable transposon delivery vector for transposition 
of a Sayjnase* gene 

The Mlul and SacII sites in pMOL553 (see figure 1) are unique. 
A modified version of the MluI-SacII fragment can therefore be 
constructed so as to contain res sites on each end of the cat 

25 gene, and can subsequently be ligated to MluI+SacII digested 
pM0L553. The construction can be performed by PCR amplifica- 
tion, using a number of oligonucleotide primers, by the SOE 
method as outlined in figure 8. The res site has been located 
in the pAM01 sequence (Swinfield, T. J., Oultram, J. D. , 

30 Thompson, D. E., Brehm, J. K. , Minton, N. P. (1990) Physical 
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characterization of the replication region of the Streptococcus 
faecalis plasmid pAM01, Gene 87, pp. 79-90) to pos. 4841-4951 
(Janniere, L. , Gruss, A., Ehrlich, S. D. (1993), Plasmids, pp. 
625-644 in Sonenshein, A. L. , Hoch, J. A., Losick, R. (Eds.) 
s Bacillus subtilis and other gram-positive bacteria: Biochemi- 
stry, Physiology, and Molecular Genetics. American Society for 
Microbiology, Washington, D.C.). Accordingly, the PCR primers 
used for the construction as outlined in figure 8 have the se- 
quences given below: 

10 LWN7794: 

Kpnl Mlul 

5 1 -GACGGGTACCACGCGTTAATCAATAAAAAAACGCTGTGCGGTTAAA- 

BamHK- X17092 pos. 4840-4863 — > 
GGGCACAGCGTTTTTTTGTGTATGGATCCTTCTATCTTTTATAGGTCATTAG-3 1 



15 LWN7790: 

pMOL553 pos. 3942-3920 >< X17092 pos. 4975-4956 
5 1 -TATATATTTTAAAAATATCCCACGGTTCTTCAAATATTTCTCC-3 1 



LWN7789: 

X17092 POS. 4956-4975 >< pM0L553 pos. 3920-3943 
20 5 • -GGAGAAATATTTGAAGAACCGTGGGATATTTTTAAAATATATAT-3 1 



LWN7788: 

X17092 pos. 4829-4808 >< pMOL553 pos . 4795-4773 
5 1 -CAAGTGTTCGCTTCGCTCTCACGGAGCTGTAATATAAAAACCTTC-3 ■ 



LWN7787: 

25 pMOL553 pos. 4773-4795 >< X17092 pos . 4808-4 829 
5 1 -GAAGGTTTTTATATTACAGCTCCGTGAGAGCGAAGCGAACACTTG-3 ' 
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LWN7784: 

PMOL553 pos. 4819-4796 >< X17092 pos . 4953-4933 
5 1 -CATATGATCAAATGGTTCGGATCTGATTTTCCTCCTCTAATATGC-3 1 



LWN8197: 

5 X17092 pos. 4933-4953 >< pMOL553 pos . 4796-4819 

5 1 -GCATATTAGAGGAGGAAAATCAGATCCGAACCATTTGATCATATGACAAGATGTG-3 1 



LWN7791: 

EcoRI SacII 
5 • -GACGGAATTCCCGCGGTAAATAGCAATAAATTGGC- 3 1 



10 LWN7780: 

Kpnl Mlul 
5 1 -GACGGGTACCACGCGTTAATC-3 ' 



PCR fragments A, B, C, and D (figure 8) were prepared as fol- 
lows , using 10 pmole of each primer in a standard PCR reaction 
15 (2 5 cycles) with Taq polymerase. 

A: Primers LWN7794 and LWN7790, template pWT (from SJ3008) , an- 
nealing temperature 49 °C The resulting fragment of 230 bp was 
purified from an agarose gel. 

B: Primers LWN7789 and LWN7788, template pMOL553, annealing 
20 temperature 51 °C. The resulting fragment of 900 bp was purified 
from an agarose gel. 

C: Primers LWN7787 and LWN7784, template pWT (from SJ3008) , 
annealing temperature 53 °C. The resulting fragment of 180 bp 
was purified from an agarose gel. 
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D: Primers LWN8197 and LWN7791, template pMOL553, annealing 
temperature 55 °C. The resulting fragment of 275 bp was purified 
from an agarose gel. 

These fragments were assembled in two steps. In a first step, 
5 aliquots of purified A and B fragments were mixed and used as 
template in a PCR reaction with primers LWN7780 and LWN7788, 
with an annealing temperature of 55 °C. The resulting combined 
fragment (AB) of 1100 bp was purified from an agarose gel. 
In parallel, aliquots of purified fragments C and D were mixed 
10 and used as template in a PCR reaction with primers LWN7787 and 
LWN7791, with an annealing temperature of 55 °C. The resulting 
combined fragment (CD) of 440 bp was purified from an agarose 
gel* 

In a second step, the two purified combined fragments, AB and 
15 CD, were mixed and used as template in a PCR reaction with 
primers LWN7780 and LWN7791, with an annealing temperature of 
55 °C. The resulting combined fragment (ABCD) of 1500 bp was 
purified from an agarose gel. 

The purified fragment (ABCD) was digested with EcoRI and Kpnl, 
20 ligated to EcoRI + Kpnl digested pUC19, and the ligation mix- 
ture transformed into E. coli SJ2, selecting resistance to 
ampicillin (100 ^g/ml) and chloramphenicol (6 nq/ml) . 5 trans- 
formants were kept, as 
SJ3372 = SJ2 / pSJ3372 
25 SJ3373 = SJ2 / pSJ3373 
SJ3374 = SJ2 / pSJ3374 
SJ3375 = SJ2 / pSJ3375 
SJ3376 = SJ2 / pSJ3376. 

The structure of these plasmids was verified by restriction 
30 analysis, and by sequencing of the res and IR parts of the 
insert using primers LWN8906: 5 ■ -GGTTTTTCGCATGTATTGCG-3 ■ and 
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LWN8907: 5 * -GTTCATTTGATATGCCTCC-3 1 , on an Applied Biosystems 
automatic sequencer. 

A restriction map of pSJ3372-76 is given in figure 9. DNA 
sequencing revealed, that the res and IR regions on plasmids 
5 pSJ3372 and pSJ3376 contained no mutations due to errors in the 
PCR amplification. 

The final transposon donor plasmid was constructed by ligating 
the 1.5 kb SacII-MluI fragment from either plasmid pSJ3372 or 
plasmid pSJ3 376 to the 9.6 kb SacII-MluI fragment from pSJ3282. 

10 The ligation mixtures were transformed into E» coli SJ2 by 
electroporation, and cells plated on either ampicillin (100 
Mg/ml) plus chloramphenicol (6 nq/vil) , or erythromycin (5 
Mg/ml) plus chloramphenicol (6 ng/ral) , at 30 °C, and transfor- 
mants restreaked on all three antibiotics . Two transf ormants 

15 were kept: 

SJ3389 = SJ2 / pSJ3389 (from pSJ3376) and 
SJ3390 = SJ2 / pSJ3390 (from pSJ3372) . 

The plasmid structures were verified by PCR amplifications and 
restriction analysis. The restriction map of pSJ3389-90 is 
20 given in figure 10. 

EXAMPLE 9 

Construction of a donor strain for coniuaative transfer of the 
transposon delivery vector of example 8 

Competent cells of B. subtilis strain PP289-5 were transformed 
25 with pSJ3389, selecting resistance to tetracycline (5 nq/ml) , 

erythromycin (5 tig/ml) , and chloramphenicol (6 iig/ml) on LBPSG 

plates with D-alanine (100 Mg/ml) . 

Transf ormants kept are 

SJ3503 = PP289-5 / pSJ3389 and 
30 SJ3504 « PP289-5 / pSJ3389 
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EXAMPLE 10 

Introduction of the transooson delivery vector of example 8 
into B t sufrtil is , and iso lation of strains in which trans- 
position has occured 

5 B. subtilis DN1885 competent cells were transformed with plas- 
mids pSJ3389 or pSJ3390, selecting erythromycin (5 Mg/ml) and 
chloramphenicol (6 Mg/ml) resistance at 30 °C. Four transfor- 
mants containing each plasmid were kept, all exhibiting a dis- 
tinct protease-positive phenotype on plates with skimmed milk: 



10 SJ34 31 




DN1885/pSJ3389 


SJ3432 




DN1885/pSJ3389 


SJ3433 




DN1885/pSJ3389 


SJ3434 




DN1885/pSJ3389 


SJ3435 




DN1885/pSJ3390 


15 SJ3436 




DN1885/pSJ3390 


SJ3437 




DN1885/pSJ3390 


SJ3438 




DN1885/pSJ3390 



These strains were inoculated in 10 ml TY medium with 3 Mg/^l 
chloramphenicol, and shaken at 30 °C overnight. 100 pi was tra- 

20 nsferred to new 10 ml TY medium tubes with 3 ng/ml chloramphe- 
nicol, and shaken at 50 °C for 4 hours. A dilution series from 
each tube was then plated on LBPSG with 6 pg/vxl chloramphenicol 
and incubated at 50 °C for two days. These plates were then 
replica plated onto 

25 i) plates with erythromycin (5 jig/ml) and chloramphenicol (6 
jig/ml) , and ii) plates with chloramphenicol (6 jug/ml) only, and 
the plates incubated at 37 °C overnight. 

Among the chloramphenicol resistant colonies, only 10-20 % had 
retained the erythromycin resistance, indicating the continued 
30 presence of the transposon donor plasmid in these cells. One 
chloramphenicol resistant, erythromycin sensitive isolate from 
each of the 8 strains was kept: 

SJ3465 from SJ3431, SJ3466 from SJ3432, SJ3467 from SJ3433, 
SJ3468 from SJ3434, SJ3469 from SJ3435, SJ3470 from SJ3436, 
35 SJ3471 from SJ3437, and SJ3472 from SJ3438. 
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SJ3470 was distinctly red in liquid TY medium. 

A control PCR reaction, using primers LWN5136: 5 1 -CCGGCGGATC- 
CAAGGGGTGATCG-3* andLWN5067: 5 * -CCAGAACCTGTCAATCCACG-3 x , with 
annealing temperature 55 °C, was performed on these strains. A 
5 fragment of 900 basepairs, as expected for amplification from 
the Savinase* construct on the transposable DNA, was obtained, 
confirming the presence of the Savinase* gene in the genome of 
these strains. 



EXAMPLE 11 

io Resolvase mediated marker deletion from the strains of example 

Strain SJ3309, described in EXAMPLE 7, is able to donate a 
resolvase-expressing plasmid to Bacillus recipient strains by 
conjugation. 

15 SJ3309 was streaked on LBPSG plates with D-alanine (100 Mg/ml) i 
tetracycline (5 iiq/ml) and erythromycin (5 /xg/ml) , and incuba- 
ted at 30 °C overnight. 

Each of the strains SJ3465 to SJ3472 were streaked on LBPSG 
plates with chloramphenicol (6 Mg/ml) and incubated at 37 °C 
20 overnight. 

On separate LBPSG plates containing D-alanine (100 Mg/ml) , each 
of the strains SJ3465 to SJ3472 were mixed with strain SJ3309, 
and the plates incubated at 3 0 °C for 5 hours. Cells were then 
resuspended in TY medium and plated on LBPSG plates with eryth- 
25 romycin (5 Mg/ml) and chloramphenicol (6 Mg/ml)/ or erythromy- 
cin only. These plates were incubated at 30 °C for three days. 

Transconjugants where the recipient strains were SJ34 65 to 
SJ3468, showed good growth on erythromycin only, but poor 
growth on both chloramphenicol and erythromycin. 
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Transconjugants where the recipient strains were SJ34 69 to 
SJ3472, showed good growth on both types of plates. 
Upon reisolation, transconjugants derived from SJ3465 to SJ3468 
could only grow on erythromycin, indicating loss of the chlora- 
5 mphenicol resistance gene* Transconjugants derived from SJ3469 
to SJ3472 could grow on both types of plates. 

To remove the resolvase expressing plasmid from the cells, 
transconjugants from SJ3465 to SJ3468 were inoculated in TY 
medium and incubated at 50 °C with shaking for 5 hours. Aliquots 

10 from these cultures were streaked on LBPSG plates, which were 
incubated at 50 °C overnight, and subsequently replicated to 
LBPSG, LBPSG with chloramphenicol (10 ^g/ml) , and LBPSG with 
erythromycin (5 jig/ml) . These plates were incubated at 37 °C 
overnight. One erythromycin sensitive, chloramphenicol sensiti- 

15 ve isolate from each culture was kept: 
SJ3461 derived from SJ3465 
SJ3462 derived from SJ3466 
SJ3463 derived from SJ3467 
SJ3464 derived from SJ3468. 

20 Transconjugants from SJ3469 to SJ3472, having retained their 
chloramphenicol resistance, were inoculated in TY medium with 
2 nq/ml erythromycin, and shaken at 3 0 °C for 3 days. Aliquots 
were plated on LBPSG at 3 0 °C, and plates replicated to LBPSG, 
LBPSG with chloramphenicol (10 nq/ml) , and LBPSG with eryth- 

25 romycin (5 nq/ml) . All colonies were erythromycin resistant, 
but 90 % of colonies from SJ3470, and all from the three ot- 
hers, were chloramphenicol sensitive, again indicating loss of 
the chloramphenicol resistance gene. 

The resolvase expressing plasmid was removed from the strains 
30 as above. The following strains were kept: 
SJ3489 derived from SJ3469 
SJ3490 derived from SJ3470 
SJ3491 derived from SJ3472. 
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The presence of the Savinase* gene was verified by PCR using 
primers LWN5136 and LWN5067 as described in example 10. 



EXAMPLE 12 

Reintroduction of the transposon delivery vector of EXAMPLE 8 
5 into the marker free strains of EXAMPLE 11. and isolation of 
strains in which transposition has occured 

pSJ3389 was transferred from SJ3503 and SJ3504 into strain 
SJ3461 by conjugation on plates, as previously described. After 
incubation of the mixed strains on D-alanine plates, cells were 
10 resuspended and plated on LBPSG containing chloramphenicol (6 
Mg/ml) and erythromycin (5 jig/ml) at 30 °C. Colonies were chec- 
ked for tetracycline resistance, indicating presence of pBC16. 
3 among 30 checked were tetracycline sensitive. These were kept 
as 

15 SJ3486 = SJ3461 / pSJ3389, SJ3487 = SJ3461 / pSJ3389, and 
SJ3488 - SJ3461 / pSJ3389. 

Strains, in which a new copy of the Savinase* gene had been 
inserted into the genome by transposition, were isolated 
essentially as previously described, except that BPX was used 
20 as the liquid medium. 

Strains SJ3486, 3487 and 3488 ( B. subtil is DN1885 containing 
one chromosomal Savinase* gene copy, plus the transposon donor 
plasmid) , and as a control strain SJ3431 ( B, subtilis DN1885 
containing only the transposon donor plasmid) were inoculated 

25 in 10 ml BPX with chloramphenicol (6 fig/ml) , and incubated with 
shaking for 4 days at 30 °C. Aliquots (appr. 100 /il) were 
transferred to new 10 ml BPX tubes with chloramphenicol (6 
^g/ml) , and incubated with shaking for 4 hours at 50 °C. 
Aliquots from these cultures were then spread on LBPSG contai- 

30 ning chloramphenicol (6 pg/ml) at 37 °c, and the resulting 
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colonies checked for erythromycin resistance by replica plating. 
All colonies derived from strain SJ3431 were erythromycin sen- 
sitive. 

All colonies derived from strains SJ3486 and SJ3488 were eryth- 
5 romycin resistant. 
About half the colonies from strain SJ3487 were erythromycin 
sensitive. 10 of these were kept as SJ3537 to SJ3546. 

EXAMPLE 13 

Resolvase mediated marker deletion from the strains of example 
10 12 

Strains SJ3537, 3539, 3541 and 3546 were chosen for further 
work. The resolvase expressing plasmid was introduced into 
these strains by conjugation from strain SJ3309, as described 
in example 11 for the one-copy strains. 

15 All transconjugant colonies, selected on erythromycin plates, 
were by replica plating found to be chloramphenicol sensitive, 
except for a few colonies from the SJ3541 recipient. The ex- 
cision of the chloramphenicol resistance gene, mediated by 
resolvase, was thus found to be very efficient. About 50 % of 

20 the colonies were tetracycline sensitive, indicating absence of 
pBC16 (which is present as a helper in the donor strain) . 
One tetracycline sensitive, chloramphenicol sensitive trans- 
conjugant colony from each recipient was propagated in TY at 50 
°C for 4 hours, then plated on LBPSG at 50 °C, and the resulting 

25 colonies checked for absence of the resolvase-expressing 
plasmid by replica plating to LBPSG plates with erythromycin (5 
Mg/ml) . All were erythromycin sensitive. One coloni, derived 
from each recipient strain, was kept: 

SJ3565 from SJ3537, SJ3566 from SJ3539, SJ3567 from SJ3541, and 
30 SJ3568 from SJ3546. 
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EXAMPLE 14 

Visualisation of Savinase* yields from the strains of examples 
10=11 

Halo formation on plates; 

5 Strains DN1885 (the B. subtilis Savinase* negative recipient 
strain), SJ3465-SJ3472 (1. transposition, Cam R , Sav*) , SJ3461 
(1. transposition Cam s , Sav* (from SJ3465)), SJ3537-SJ3546 (2. 
transposition (in SJ3461) , Cam R , Sav*) , and SJ3565-SJ3568 (2. 
transposition, Cam s , Sav*) were streaked on LBPSG plates 

io containing 1 % skimmed milk. After overnight incubation at 37 
°C, differenses in halo formation were observed. DN1885 had 
hardly any halo, 1. transposition strains had a distinct halo, 
and 2. transposition strains seemed to have a slightly larger 
halo than 1. transposition strains. No difference in halo size 

15 was observed due to the removal of the cat gene. 

ttjmcjni immunodiffusion assay; 

Strains were inoculated in 100 ml BPX shake flasks (with addi- 
tion of 0.5 ml 1M NaOH) , and shaken for 4 days at 30 °C, 300 
rpm. 

20 Supernatants were subjected to a Mancini immunodiffusion assay 
in an agarose gel containing anti-Savinase antibody. 
No precipitation zone was seen with DN1885, in accordance with 
the absence of the Savinase* gene from this strain. Zones of 
somewhat varying size were seen for the 1. transposition 

25 strains (SJ3465-3472) . Removal of the cat gene did not signifi- 
cantly affect this zone (SJ3461) . Zones of conspicuously larger 
size were seen from the 2. transposition strains (SJ3537-3546) , 
and removal of the cat gene did not significantly affect these 
zones (SJ3565-3568) . The immunodiffusion plates are reproduced 

30 in figure 11, A and B. 
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CONCLUSION: 

These phenotypical tests thus revealed the results of the 
strain constructions: One copy (at least) of the Savinase* gene 
could be inserted by transposition, giving rise to Savinase* 

5 production. The chloramphenicol resistance gene on the trans- 
posed DNA could be removed using resolvase, and the resulting 
strain could still produce Savinase* at the original level. One 
(or more) further copies of the Savinase* gene could be 
inserted into the one copy strain, reusing the same transposon 

io donor plasmid as used for insertion of the first copy. This 
resulted in an increased Savinase* yield. The chloramphenicol 
resistance gene on the transposed DNA could subsequently be 
removed using resolvase again, and the resulting strains 
retained their ability to produce increased Savinase* yields. 

15 EXAMPLE 15 

Southern analysis of stra ins of examples 10-13 

The strains of examples 10-13 were grown overnight in TY 
medium, and chromosomal DNA extracted by standard procedures 
(phenol/chloroform extractions) . The DNA was digested with 

20 EcoRI, which cuts within the Savinase* gene part of the 
transposable DNA, and in addition with enzymes Bgll, PstI and 
SacII, which does not cut within the transposable DNA. Several 
enzymes were used together in order to obtain DNA fragments of 
moderate size, that could be well resolved on the gel. Frag- 

25 ments were transferred to Immobilon-N (Millipore) membranes 
after electrophoresis by vacuum blotting, and the membrane 
probed with biotinylated labeled probes, using the NEBlot 
Phototope Kit and Phototope Detection Kit from New England 
Biolabs. The same membrane was used with several sets of 

30 probes. First, a probe recognizing the upstream half of the 
transposable DNA was used, i.e. the region between the IR at 
around pos. 2 000 and the EcoRI site at around pos. 3 600 in 
PSJ3389. The probe was made by PCR amplification from pMOL553, 
using as primers LWN5136 and LWN5067. This probe is expected to 
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recognize one fragment derived from each transposon insertion, 
and this fragment should not be affected by the subsequent 
action of the resolvase. This is exactly what is observed from 
figure 12A. Strains SJ3465 to SJ3472 were isolated following 

5 the first transposition round, and contain both the sat and the 
savinase gene. As they resulted from different transposition 
events, different fragment sizes are observed. Strain SJ3461 
was isolated from SJ3465, and contain only the savinase gene. 
The same hybridizing fragment is observed in this strain (2.0 

10 kb) . Strains SJ3537-3546 were isolated following the second 
transposition round. They contain the same fragment as SJ3461 
(2.0 kb) , but has acquired an additional hybridizing fragment 
of 1.8 kb. The same two hybridizing fragments are observed in 
strains SJ3565-3568, in which the resolvase gene was used to 

« delete the cat gene (DNA from SJ3565 was degraded by nuclea- 
ses) . 

The membrane was subsequently stripped for the probe, and 
rehybridized with two probes. One was the same as before. The 
other was the PCR fragment containing ggs-cat-res-IR, called 

20 ABCD in example 8. The first probe should recognize the same 
fragments as before. The new probe should recognize a new 
fragment, and the size of this fragment should become reduced 
by the resolvase mediated deletion of the cat gene. This is 
exactly what is observed from figure 12B. The new fragment, of 

25 about 3.7 kb in strain SJ3465, is reduced to about 2.7 kb in 
strain SJ3461. Strains SJ3537-46 contain an additional fragment 
of 3.3 kb, which is reduced to 2.3 kb in strains SJ3 566-68. 



Fragments were visible at higher positions on the southern blot 
- these were due to incomplete digestion of the chromosomal DNA 
30 with one or more of the enzymes. 

Also, the fragments containing the res -cat-res -IR part of the 
transposon are actually visible on figure 12A. This is caused 
by the presence of some plasmid pMOL553 in the material used 
for labelling (The PCR fragment was not gel purified) . 
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EXAMPLE 16 

Construction — of a mobilizable transooson delivery vector 

containing a res -spc-res cassette 

A) Amplification of the spc resistance gene. 

5 The spectinomycin resistance gene was obtained from plasmid 
PMAP29 (figure 4) on a 1.2 kb Sall-BaroHI fragment. This 
fragment was purified from an agarose gel, and ligated to Xhol 
+ BamHI digested pDN3000 (Diderichsen et al., 1990, J. Bac- 
terid. 172, 4315-4321), to give plasmid pSJ3216 (figure 13). 
10 Ligation mixtures were introduced into E. coli SJ2 by elec- 
troporation, and selection was for ampicillin (100 ^g/ml) and 
spectinomycin (60 ng/ml) resistance. 

The spc gene was PCR amplified from pSJ3216 using primers 
LWN8524 : 5 1 -GACTGAATTCGGATCCACGCGTATAATAAAGAATAATTATTAATCTGTAG- 

15 3 1 , and LWN8528: 5 1 -GACTAAGCTTGAGCTCCACTAATATTAATAAACTATCGAAGG- 
3'; Annealing temperature was 50 °C. The fragment was purified 
on an agarose gel, digested with EcoRI and Hindlll, and ligated 
to EcoRI + Hindlll digested pUC19. E. coli SJ2 was transformed 
by electroporation, selecting ampicillin resistance (100 Mg/ml) 

20 on plates with IPTG and X-gal. White colonies were reisolated 
on plates with ampicillin (100 Mg/ml) and spectinomycin (60 
Mg/ml) . Strains kept are SJ3318 (SJ2/pSJ3318) and SJ3319 
(SJ2/pSJ3319) . The map of pSJ3318 is given in figure 14. 

B) Amplification of the res site with multilinker. 



25 Plasmid pWT (figure 3) was used as template in a PCR reaction 
with primers LWN8529: 5 1 -GACTGAATTCCTGCAGGAGCTCAGTGAGAGCGAAGC- 
GAACAC-3 • and LWN8531 : 5 1 -GACTAAGCTTTGATCAAATGGTTGCGGCCGCGT- 
CGACTCTAGACCCGGGTACCAGATCTGGATCCTCGGGTC ■ ; an- 

nealing temperature was 59 °C. The fragment was purified from 

30 an agarose gel, digested with EcoRI and Hindlll, and ligated to 
EcoRI + Hindlll digested pUC19. E, coli SJ2 was transformed by 
electroporation, selecting ampicillin resistance (100 Mg/ml) on 
plates with IPTG and X-gal. White colonies were reisolated on 
similar plates. Plasmids prepared from selected transf ormants 
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were DNA sequenced using primer LWN7191: 5 1 -GTTTTCCCAGTCACGAC . 
Two plasmids with correct sequences of the insert were kept: 
SJ3328 (SJ2/pSJ3328) and SJ3329 (SJ2/pSJ3329) (figure 15). 

C) Amplification of the res site. 

5 Plasmid pWT was used as template in a PCR reaction with primers 
LWN8518 : 5 1 -GACTAAGCTTACGCGTTCGGGTTCTTCAAATATTTCTCC~3 » and 
LWN8 527: 5 • -GACTGAATTCTGATCAAATGGTTCAGTGAG AGCGAAGCGAACAC- 3 1 ; 
annealing temperature was 59 °C. The fragment was purified from 
an agarose gel, digested with EcoRI and Hindlll, and ligated to 

io EcoRI + Hindlll digested pUC19. E, coli SJ2 was transformed by 
electroporation, selecting ampicillin resistance (100 /xg/ml) on 
plates with IPTG and X-gal. White colonies were reisolated on 
similar plates. Plasmids prepared from selected transformants 
were DNA sequenced using primer LWN7191: 5 1 -GTTTTCCCAGTCACGAC . 

15 Two plasmids with correct sequences of the insert were kept: 
SJ3326 (SJ2/pSJ3326) andSJ3327 (SJ2/pSJ3327 ) (figure 16). 



D) Construction of a spc - res fragment. 

The l.l kb SacI-EcoRI fragment from pSJ3318 was isolated from 
an agarose gel, and ligated to SacI + EcoRI digested pSJ3328. 
20 The ligation mixture was introduced into E. co^j. SJ6 by elec- 
troporation, and transformants selected on plates with ampi- 
cillin (100 Mg/ral) and spectinomycin (60 iig/ml) . Two transfor- 
mants were kept, SJ3341 (SJ6/pSJ3341) and SJ3342 (SJ6/pSJ3342) 
(figure 17) . 

25 E) Construction of a res - spc - res fragment. 

The 0.2 kb EcoRI -Mlul fragment from pSJ3326 was purified from 
an agarose gel, and ligated to EcoRI + Mlul digested pSJ3341. 
The ligation mixture was introduced into E. coli SJ6 by elec- 
troporation, and transformants selected on plates with ampicil- 

30 lin (100 fig/ml) and spectinomycin (60 Mg/ml) . Two transformants 
were kept, SJ3358 (SJ6/pSJ3358) and SJ3359 (SJ6/pSJ3359) 
(figure 18) . 
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F) Cloning of minitransposon (IR-cat-IR) in pUC. 

The minitransposon segment (without the transposase gene) was 
excised from pHV1248 on a 1.2 kb Hindlll fragment, which was 
ligated to Hindlll digested pUC19 and used to transform E. coli 
5 SJ6 by electroporation to ampicillin (100 /ig/ml) and chloramp- 
henicol (10 jig/ml) resistance. Two transformants were kept, 
SJ3354 (SJ6/pSJ3354) andSJ3355 (SJ6/pSJ3355) (figure 19). 

G) Insertion of the res - spc - res segment into minitransposon. 
The minitransposon contained BsaBI sites in the IR sequences 

10 outside the terminal regions essential for transposition. These 
sites were used for insertion of the res - spc - res segment. 

The res - spc - res segment was excised from pSJ3358 on a 1.6 kb 
HincII-PvuII fragment, purified from an agarose gel, and liga- 
ted to the 2.8 kb BsaBI fragment of pSJ3354. The ligation 
15 mixture was used to transform, by electroporation, E. coli 
strain SJ6 to ampicillin (100 jig/ml) and spectinomycin (60 
/ig/ml) resistance. Two transformants were kept, SJ3444 
(SJ6/pSJ3444) and SJ3445 (SJ6/pSJ3445) (figure 20). 

H) Construction of a res - spc - res containing transposon donor 
20 plasmid. 

The transposable cassette was excised on a 1.85 kb Hindlll 
fragment from plasmid pSJ3444, which was ligated to the 7.85 kb 
Hindlll fragment from pSJ3282. The ligation mixture was 
introduced into E. coli SJ6 by electroporation, selecting 
25 ampicillin (100 jig/ml) and spectinomycin (60 txq/ml) resistance. 
Two transformants were kept, with the transposable cassette in 
different orientations relative to the vector part: SJ3475 
(SJ6/pSJ3475; -figure 21) and SJ3476 (SJ6/pSJ3476 ; figure 22). 

The functionality of this transposon donor plasmid was tested. 
30 Plasmids pSJ3475 and pSJ3476 were transformed into competent 
subtil is DN1885 selecting for erythromycin (5 /ig/ral) and spec- 
tinomycin (60 /xg/ml) resistance. Four transformants with each 
plasmid were taken through a transposition round: They were 
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grown overnight in 10 ml TY with spectinomycin (60 Mg/ml) . 50 
/xl was transferred to a new culture of 10 ml TY with spectino- 
mycin (60 ^g/ml), incubated at 50 °C for 4 hours (resulting in 
good growth of pSJ3475 transf ormants, poor growth of pSJ3476 

5 transf ormants) , and plated on LBPSG with spectinomycin (60 
Mg/ml) at 50 °C. 10 single colonies from each plate was streaked 
on duplicate plates with erythromycin (5 ng/ml) and spec- 
tinomycin (60 ng/ml) , respectively. Most colonies were eryth- 
romycin sensitive, indicating proper transposition of the IR- 

10 res - soc - res -IR segment and loss of the donor plasmid. 2 Spc R , 
Erm s colonies derived from each of pSJ3475 and pSJ3476 trans- 
f ormants were used as recipients for conjugation with SJ3 309 as 
donor, and transconjugants selected at 30°C on plates with kana- 
mycin (10 /xg/ml) and with or without spectinomycin (60 Mg/ml) . 

is Good growth was observed on kanamycin plates, but transcon- 
jugants from three of the four strains grew poorly on plates 
with both antibiotics* Spectinomycin sensitive, kanamycin re- 
sistant colonies were in all instances obtained by reisolation 
from the kanamycin plates. 

20 Thus, the modified transposon functions in B. subtil is . and the 
spectinomycin resistance gene can be subsequently deleted using 
resolvase. 



EXAMPLE 17 

Construction of a mobilizable transposon delivery vector 
25 containing a res-kan-res cassette 

A) Amplification of the kan resistance gene. 

The kan gene was PCR amplified from pUBHO using primers 
LWN8516 : 5 1 -GACTGAATTCGGATCCACGCGTGAGTAGTTCAACAAACGGGCC-3 1 and 
LWN8 517: 5 1 -GACTAAGCTTGAGCTCCAACATGATTAACAATTATTAGAGG- 3 1 ; an- 
30 nealing temperature was 59 °C. The fragment was purified from 
an agarose gel, digested with EcoRI and Hindlll, and ligated to 
EcoRI + Hindlll digested pUC19. E. coli SJ2 was transformed by 
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electroporation, selecting ampicillin resistance (100 ng/ml) on 
plates with IPTG and X-gal. White colonies were reisolated on 
plates with ampicillin (100 ng/ml) and kanamycin (20 pg/ml) . 
Strains kept are SJ3316 (SJ2/pSJ3316) and SJ3317 (SJ2/pSJ3317) 
5 (figure 23) . 

B) Construction of a kan - res fragment. 

The l.o kb SacI-EcoRI fragment from pSJ3316 was isolated from 
an agarose gel, and ligated to Sad + EcoRI digested pSJ3 328. 
The ligation mixture was introduced into E. coli SJ6 by elec- 
io troporation, and transf ormants selected on plates with ampi- 
cillin (100 fig/ml) and kanamycin (10 ^g/ml) . Two transf ormants 
were kept, SJ3339 (SJ6/pSJ3339) and SJ3340 (SJ6/pSJ3340) 
(figure 24) . 

C) Construction of a res - kan - res fragment. 

15 The 0.2 kb EcoRI -Mlul fragment from pSJ3326 was purified from 
an agarose gel, and ligated to EcoRI + Mlul digested pSJ3340. 
The ligation mixture was introduced into E. coli SJ6 by elec- 
troporation, and transformants selected on plates with ampicil- 
lin (100 fig/ml) and kanamycin (10 fig/ml). Two transformants 

20 were kept, SJ3356 (SJ6/pSJ3356) and SJ3357 (SJ6/pSJ3357) 
(figure 25) . 



D) Construction of a res - kan - res containing transposon donor 
plasmid. 

i) Cloning of a transposon IR sequence. 

25 The "downstream" transposon IR sequence was PCR amplified from 
PMOL553 as template, using primers LWN8760: 5 1 -GACGGAATTCTC- 
TAGAGTCGACAGATCCGAACCATTTGATCATATGACAAGATGTG-3 ' and LWN8761 : 
5 • -GACGGAATTCGCGGCCGCGGTAAATAGCAATAAATTGGCC-3 1 . The purified 
fragment was digested with EcoRI, ligated to EcoRI digested 

30 pUC19, and introduced by electroporation into E. coli SJ2 , 
selecting ampicillin resistance (100 fig/ml) on plates with IPTG 
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and X-gal. About 20 white colonies among 500 was obtained. 
Plasmid from three was DNA sequenced using as primer LWN4123: 
5 1 - AGCGGATAACAATTTCACACAGG A- 3 1 . Two correct clones were kept as 
SJ3385 (SJ2/pSJ3385) (figure 26) and SJ3386 (SJ2/pSJ3386) . 

5 ii) Combination of one IR with res - kan - res sequence. 
The IR sequence was excised from pSJ3385 as a 0.3 kb Notl-Xbal 
fragment, ligated to NotI + Xbal digested, phosphatase treated 
pSJ3356, and introduced into SJ6 by electroporation, selecting 
ampicillin (100 Mg/ml) and kanamycin (10 fig/ral) resistance. Two 

10 trans forroants were kept as SJ3459 (SJ6/pSJ3459) and SJ3460 (SJ- 
6/pSJ3460) (figure 27). 

iii) Construction of the final transposon donor plasmid. 

In this step, the spc - res segment from donor plasmid pSJ3476 

was replaced with the corresponding kan - res segment from 

15 pSJ3459. Thus, this 1.15 kb fragment was excised from pSJ3459 
using Mlul and BamHI, and the purified fragment ligated to the 
8.2 kb MluI-BamHI fragment from pSJ3476. The mixture was 
introduced into E. coli SJ6 by electroporation, selecting 
ampicillin (100 fxg/ml) and kanamycin (10 Mg/ml ) resistance. Two 

20 transformants were kept as SJ3586 (SJ6/pSJ3586) and SJ3587 
(SJ6/pSJ3587) (figure 28). 

EXAMPLE 18 

Use of the transposon delivery vector o f EXAMPLE 16 for 

transposition of an amylase gene into the B. subtilis genome 

25 The B. licheniformis amylase gene (amyXj) gene was excised from 
pDN1981 as a 3.2 kb Bglll-Bcll fragment, and ligated to BamHI 
digested pSJ3476. The ligation mixture was transformed into 
subtilis DN1885, selecting erythromycin resistance (5 tig/ml) at 
30 °C. Four amylase positive colonies were reisolated on 

30 erythromycin (5 iig/ml) and spectinomycin (60 Mg/ml) plates, and 
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kept as SJ3524 (DN1885/pSJ3524) , SJ3525 (DN1885/pSJ3525) , 
SJ3526 (DN1885/pSJ3526) , and SJ3527 (DN1885/pSJ3527) (figure 
29) . 

Strains SJ3524-SJ3527 were tested for transposition . Each was 
5 inoculated in LB with erythromycin (5 /ig/ml) and spectinomycin 
(60 jig/ml) , incubated overnight at 30 °C, and aliquots trans- 
ferred to LB with spectinomycin (60 ^g/ml) . These cultures were 
shaken at 50 °C for 5 hours, then plated on spectinomycin plates 
(60 /ig/ml) and incubated at 30 °C overnight. Plates from SJ3524- 
io 3526 were almost overgrown; 8 colonies from each were tested 
and found erythromycin resistant ♦ The plate from SJ3527 was 
replica plated, and several erythromycin sensitive, spec- 
tinomycin resistant and amylase postivive colonies were found. 
One was kept as SJ3549. 

15 EXAMPLE 19 

Resolvase mediated marker deletion from the strain o f fvampt/e 
18 

The resol vase-expressing plasmid was introduced into SJ3549 by 
conjugation from SJ3309 as previously described. Transconju- 

20 gants were selected on kanamycin (10 nq/ml) and spectinomycin 
(60 jig/ml), and reisolated on kanamycin (10 nq/ml) with or 
without spectinomycin (60 /ig/ml) . Growth was poor on plates 
with both antibiotics, good on kanamycin plates. 10 colonies 
from kanamycin plates were reisolated on LBPSG, and subsequent- 

25 ly replica plated. 8 spectinomycin sensitive, kanamycin re- 
sistant kolonies thus isolated were inoculated in TY and in- 
cubated at 50 °C overnight, then plated on LBPSG, and these pla- 
tes replicated after overnight incubation at 3 7 °C. All co- 
lonies, except some from culture 1, were kanamycin sensitive. 

30 Reisolation confirmed sensitivity to kanamycin (10 /ig/ml) and 
erythromycin (5 ^g/ml) . Three strains were kept: 
SJ3558 (Spc s , Erm s , Kan s , Amy*) 
SJ3559 (Spc s , Erm s , Kan s , Amy*) 
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SJ3560 (Spc s , Erm s , Kan s , Amy*) 

The three strains were tetracycline sensitive, indicating ab- 
sence of pBC16, which is present in the conjugation donor str- 
ain. 



5 EXAMPLE 20 

construction of a donor strain for coniuaativ e transfer of the 
transoos on delivery vector of example 18 

Plasmids pSJ3524 and pSJ3526 were transformed into competent 
cells of a B. subtil is donor strain (daT, pXO503 (erm R ) , pBC16 
10 (tet R )) for conjugation, plated on LBPSG with erythromycin (5 
Mg/ml) , tetracycline (5 nq/ml) , spectinomycin (60 tig/ml) and D- 
alanine (100 Mg/ml) at 30 °C, and one strain containing each 
plasmid was kept: SJ3547, containing pSJ3524, and SJ3548, 
containing pSJ3526. 



15 EXAMPLE 21 

Re introduction of the transooson deliver y vector of EXAMPLE 18 
ir>to the marker free strains of examp le 19. and isolation of 
strains in which transposition has occured 

Plasmid pSJ3524 was transferred into SJ3559, a marker free amyL 
20 expressing strain constructed in example 19, by conjugation 
from SJ3547 as previously described. One tetracycline sensitive 
transconjugant was kept as SJ3592. This strain was incubated 
with shaking in TY with spectinomycin (60 Mg/ml) at 3 0 °C 
overnight, diluted 100 times into a new TY culture with spec- 
25 tinomycin (60 pq/ml) , and incubated with shaking for 3 hours at 
30 °C. It was then diluted 50 times into a new TY culture with 
spectinomycin (60 Mg/ml) , incubated with shaking for 3 hours at 
50 °C, and subsequently plated on LBPSG with spectinomycin (60 
ng/ml) at 50 °C. Almost all of the resulting colonies were found 
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to be erythromycin sensitive by replica plating. Four were 
kept, as SJ3626-SJ3629. 



In a similar experiment, pSJ3524 was transferred into SJ3560 by 
conjugation from SJ3547, and two tetracycline sensitive 
5 transconjugants kept as SJ3593 and SJ3594. These were incubated 
with shaking in TY with spectinomycin (60 ng/ml) at 30 °c 
overnight, diluted 100 times into a new TY culture with 
spectinomycin (60 ng/ml) , incubated with shaking for 6 hours at 
50 °C, and plated on LBPSG with spectinomycin (60 /ig/ml) at 50 
10 °C. By replica plating, one spectinomycin resistant and 
erythromycin sensitive colony was found from SJ3593 (kept as 
SJ3599), and two such colonies were found from SJ3594 (kept as 
SJ3600 and SJ3601) . 



EXAMPLE 22 

15 Pesolvase mediated marker d e letion from the strains of example 
21 

Strains SJ3626-3629 were used as recipients to recieve the 
resolvase expressing plasmid by conjugation from strain SJ3309. 
Transconjugants were selected on LBPSG with kanamycin (10 

20 Mg/ml) . colonies from these plates were replicated to kanamycin 
(10 /ig/ml) and spectinomycin (60 ng/nl) , and all were found to 
be spectinomycin sensitive. 4 colonies derived from each 
recipient strain were transferred three consecutive tiroes 
through overnight TY cultures at 30 °C, plated on LBPSG at 3 0 

25 °C, and replica plated to LBPSG with erythromycin. Sensitive 
colonies were obtained in each case, and these were subsequent- 
ly checked for kanamycin resistance. One erythromycin, kanamy- 
cin, and spectinomycin sensitive isolate derived from each 
recipient was kept: SJ3634 (from SJ3626) , SJ3635 (from SJ3627) , 

30 SJ3636 (from SJ3628), and SJ3637 (from SJ3629). 
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Strains from both 1. and 2. transposition round were inoculated 
in BPX shake flasks and incubated with shaking for 6 days at 37 
°C. Alpha-amylase activity was determined using the Phadebas 
Amylase Test from Kabi Pharmacia. The results are given here in 
5 relative, arbitrary units: 



Units/ml 

B. subtilis DN1885 host strain 0.4 
(in triplicate) °- 26 



0.25 



10 SJ3549 (1. transposition, Spc R ) 
(in duplicate) 

SJ3559 (1. transposition, Spc s ) 
(in triplicate) 



2.45 
2.42 

2.4 

2.16 

2.35 



15 SJ3626 ) 

SJ3627 ) (2. transposition, 

SJ3628 ) Spc R ) 

SJ3629 ) 



3.28 
3.6 
3.25 
3.14 



SJ3634 ) 

20 SJ3635 ) (2. transposition, 

SJ3636 ) Spc s ) 

SJ3637 ) 



3.21 
3.3 
3.38 
3.1 



It is evident that a certain production of alpha-amylase is 
obtained from the 1. transposition strain, this level is unaf- 
25 fected by the subsequent deletion of the spectinomycin re- 
sistance gene mediated by resolvase, the level is increased in 
the 2. transposition strains, and this increased level is main- 
tained upon deletion of the spectinomycin resistance gene me- 
diated by resolvase. 
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In a similar experiment, strains SJ3599-3601 were used as 
recipients to recieve the resolvase expressing plasmid by 
conjugation from strain SJ3309. Transconjugants were selected 
on LBPSG with kanamycin (10 Mg/ml) . Colonies from these plates 
5 were replicated to kanamycin (10 Mg/ml) , spectinomycin (60 
/ig/ml) , and tetracycline (5 ng/ml) . From each recipient, 
kanamycin resistant but spectinomycin and tetracycline sensiti- 
ve colonies could be isolated. 

The resolvase expressing plasmid, conferring resistance to both 
o kanamycin and erythromycin, could subsequently be cured from 
the cells by a few successive transfers of strains in liquid TY 
medium at 30 °C (without antibiotics), and plasmid free cells 
isolated by replica plating to plates with erythromycin (5 
Mg/ml) . one such erythromycin sensitive strain isolated from a 
5 kanamycin resistant transconjugant of SJ3600 was kept as SJ36- 
40. 



EXAMPLE 23 

Introduction of the transposon del ivery vector of example 18 
into the marker free str ains of example 13. and isolation of 
20 strains jn which transpo sit i o n has occured 

In this and the following example, the tools developed in 
previous examples are combined to create a Bacillus subtilis 
strain, free of antibiotic resistance markers, containing two 
copies of the Savinase* gene and one copy of the B. lichen if or- 
25 mis amylase gene in its genome. 

Strains SJ3565 and SJ3566, each containing two copies of the 
Savinase* gene, were used as recipients in a conjugation with 
strain SJ3547. Transconjugants were selected on LBPSG with 
spectinomycin (60 ng/ml) and erythromycin (5 Mg/ml) . All were 
30 amylase-positive and protease-positive. About half were 
tetracycline sensitive, indicating absence of pBC16. Two 
tetracycline sensitive colonies derived from each recipient 
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were kept: SJ3595 and SJ3596, from SJ3565, and SJ3597 and 
SJ3598, from SJ3566. 

These strains were incubated with shaking in TY with spec- 
tinomycin (60 iig/nl) at 30 °C overnight, diluted 100 times into 

5 a new TY culture with spectinomycin (60 Mg/ml) , incubated with 
shaking for 6 hours at 50 °C, and plated on LBPSG with spec- 
tinomycin (60 ng/ml) at 50 °C. These plates were subsequently 
replicated to plates with either spectinomycin (60 iig/ml) or 
erythromycin (5 tig/ml) . No erythromycin sensitive colonies were 

10 obtained from SJ3598, whereas about 90 % of the colonies from 
the other three strains were spectinomycin resistant, but 
erythromycin sensitive. One such colony from each strain was 
kept: SJ3602 (from SJ3595) , SJ3603 (from SJ3596) , and SJ3604 
(from SJ3597) . 

15 EXAMPLE 24 

Resolvase mediated marker deletion from the strains of example 
21 

The resolvase expressing plasmid was transferred from strain 
SJ3309 into each of strains SJ3602-SJ3604 by conjugation as 

20 previously described. Transconjugants were selected on LBPSG 
plates with kanamycin (10 ng/ml) at 30 °C, and replica plated 
to plates with either kanamycin (10 tig/nl) , spectinomycin (60 
Mg/ml), or tetracycline (5 itg/ml) . Spectinomycin and tetra- 
cycline sensitive colonies were obtained derived from each 

25 recipient strain. 

The resolvase expressing plasmid, conferring resistance to both 
kanamycin and erythromycin, could subsequently be cured from 
the cells by a few successive transfers of strains in liquid TY 
medium at 30 °C (without antibiotics) , and plasmid free cells 
30 isolated by replica plating to plates with erythromycin (5 
Mg/ml) . One such erythromycin sensitive strain isolated from a 
kanamycin resistant transconjugant from each of strains SJ3602- 
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3604 were kept as SJ3641 (from SJ3602) , SJ3642 (from SJ3603) 
and SJ3643 (from SJ3604). 



EXAMPLE 25 

Construction and use of a transposon donor plasmid containing 
5 the transposase genet together with a marker gene, flanked by 
res sequences . 

The purpose of this example is to create and use a transposon 
donor plasmid, in which transposase and marker together resides 
between res sequences. This plasmid is then used to isolate 

io strains where transposition has taken place , and the donor 
plasmid has been lost. The transposed fragment is able to 
undergo further transpositions, which can be selected for by 
increased antibiotic resistance. When desired strains are 
obtained, resistance genes and transposases can be deleted 

15 using resolvase, as described in previous examples. 

Previously constructed res -marker- res cassettes (examples 16 
and 17) contain a unique Mlul site, into which transposase can 
be inserted. 

(1) Transposase is PCR amplified from pHVl248, using as 

20 primers 

BamHI Mlul <-pHV1248 125-142-> 
Tnasel : 5 1 -TGACGGATCCACGCGTGGCGCACTCCCGTTCTGG-3 1 
and 

BamHI Mlul <-pHV1248 1550-1531-> 
25 Tnase2 : 5 » -GTACGGATCCACGCGTAAAGGCACCTTTGGTCACGG-3 • 

The PCR fragment is gel purified, digested with BamHI, and 
cloned into pUC19 digested with BamHI. Some clones are DNA 
sequenced, and a correct one used in further work. 
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(2) A spectinomycin resistance cassette with transposase 
is constructed by cloning of the Mlul fragment from (1) , above, 
into pSJ3444. 

(3) A kanamycin resistance cassette with transposase is 
5 constructed in two steps: i) The Hindlll fragment from pSJ3586 

is cloned into pUC19. ii) The Mlul fragment from (1) is cloned 
into i). 

The Hindlll fragments can then be cloned into pSJ3476 or 
pSJ3282. The resulting plasmid will have two transposase genes, 
10 and clones with these genes in antiparallel orientation can be 
used as transposon delivery vectors, into which e.g. JL_ 
licheniformis amylase or Savinase* genes may be inserted. 

Vector plasmids with only one transposase gene may be prepared 
as follows: 

15 (4) The Nhel-Xbal fragment from pSJ3475, containing the coli- 
vector part of this plasmid, and the transposase, is deleted. 
This plasmid is then used as vector for cloning of the Hindlll 
fragments containing the res-marker-transposase-res segments 
described above. 

20 (5) The Scal-Hindlll fragment from pSJ3282 is replaced by 

the Scal-Hindlll fragment from pUC18 (this deletes the entire 
transposon part) . Subsequently this new plasmid is used for 
cloning of Hindlll fragments, as above. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(1) APPLICANT: 

A) NAME: Novo Nordisk A/S 

5 (B) STREET: Novo Alle 

C) CITY: DK-2880 Bagsvaerd 

E) COUNTRY: Denmark 

F) POSTAL CODE (ZIP): DK-2880 
6) TELEPHONE: +45 44 44 88 88 

w (H) TELEFAX: +45 44 49 32 56 

(I) TELEX: 37304 

(11) TITLE OF INVENTION: TITLE 

(111) NUMBER OF SEQUENCES: 36 

15 (1v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

20 (2) INFORMATION FOR SEQ ID NO: 1: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10216 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS: single 
zs (D) TOPOLOGY: linear 

(11) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "pM0L553" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

TTCTCATGTT TGACAGCTTA TCATCGACTG CACGGTGCAC CAATGCTTCT GGCGTCAGGC 60 

30 AGCCATCGGA AGCTGT6GTA TGGCTGTGCA GGTCGTAAAT CACTGCATAA TTC6TGTCGC 120 

TCAAGGCGCA CTCCCGTTCT GGATAATGTT TTTTGCGCCG ACATCATAAC GGTTCTGGCA 180 

AATATTCTGA AATGAGCTGT TGACAATTAA TCATCGGCTC GTATAATGTG TGGAATTGTG 240 

AGCGGATAAC AATTTCACAC AGGAAACAGG ATCAAATGGT TTCGAATTCA TTAAAGAGGA 300 

6AAATTAACT ATGTGCGAAC TCGATATTTT ACACGACTCT CTTTACCAAT TCTGCCCCGA 360 

35 ATTACACTTA AAACGACTCA ACAGCTTAAC GTTGGCTTGC CACGCATTAC TTGACTGTAA 420 

AACTCTCACT CTTACCGAAC TTGGCCGTAA CCTGCCAACC AAAGCGAGAA CAAAACATAA 480 
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CATCAAAC6A ATC6ACC6AT T6TTA6GTAA TCGTCACCTC CACAAA6A6C 6ACTC6CTGT 540 

ATACCGTTG6 CAT6CTAGCT TTATCTGTTC GC6CAATACG AT6CCCATTG TACTTGTT6A 600 

CT6GTCTGAT ATTC6TGAGC AAAAACGACT TATGGTATT6 CGAGCTTCAG TCGCACTACA 660 

C6GTCGTTCT GTTACTCTTT ATGAGAAAGC GTTCCCGCTT TCAGAGCAAT GTTCAAAGAA 720 

5 AGCTCAT6AC CAATTTCTA6 CC6ACCTT6C 6AGCATTCTA CC6AGTAACA CCACACC6CT 780 

CATTGTCA6T GATGCT66CT TTAAAGT6CC ATGGTATAAA TCCGTT6A6A AGCTG6GTTG 840 

6TACTG6TTA A6TC6AGTAA GA66AAAAGT ACAATATGCA GACCTAGGAG CGGAAAACTG 900 

GAAACCTATC AGCAACTTAC AT6ATAT6TC ATCTAGTCAC TCAAA6ACTT TAGGCTATAA 960 

6A66CTGACT AAAAGCAATC CAATCTCAT6 CCAAATTCTA TT6TATAAAT CTCGCTCTAA 1020 

10 A66CC6AAAA AATCAGCGCT C6ACAC6GAC TCATTGTCAC CACCC6TCAC CTAAAATCTA 1080 

CTCA6CGTC6 GCAAA66AGC CATG6TTCTA 6CAACTAACT TACCTGTT6A AATTC6AACA 1140 

CCCAAACAAC TTGTTAATAT CTATTCGAAG CGAATGCA6A TTGAA6AAAC CTTCC6A6AC 1200 

TT6AAAAGTC CTGCCTAC6G ACTAG6CCTA CGCCATA6CC GAAC6AGCAG CTCAGAGCGT 1260 

TTT6ATATCA TGCT6CTAAT CGCCCTGATG CTTCAACTAA CATGTT6GCT TGCGG6CGTT 1320 

15 CATGCTCAGA AACAAGGTTG GGACAAGCAC TTCCAGGCTA ACACAGTCAG AAATCGAAAC 1380 

6TACTCTCAA CAGTTCGCTT AG6CATGGAA GTTTTGCGGC ATTCTGGCTA CACAATAACA 1440 

AGG6AAGACT TACTC6T6GC T6CAACCCTA CTAGCTCAAA ATTTATTCAC ACATGGTTAC 1500 

6CTTTGGG6A AATTATGAGG G6ATCTTCGA CCGTGACCAA A6GT6CCTTT TATCATCACT 1560 

TTAAAAATAA AAAACAATTA CTCAGTGCCT GTTATAAGCA 6CAATTAATT AT6ATTGATG 1620 

20 CCTACATCAC AACAAAAACT 6ATTTAACAA ATG6TT6GTC TGCCTTAGAA AGTATATTTG 1680 

AACATTATCT TGATTATATT ATTGATAATA ATAAAAACCT TATCCCTATC CAA6AAGTGA 1740 

T6CCTATCAT TGGTTGGAAT GAACTTGAAA AAATTAGCCT T6AATACATT ACTGGTAAGG 1800 

TAAACGCCAT TGTCAGCAAA TTGATCCAAG AGAACCAACT TAAAGCTTTC CTGACGGAAT 1860 

GTTAATTCTC GTTGACCCTG AGCACTGATG AATCCCCTAA TGATTTTGGT AAAAATCATT 1920 

25 AA6TTAAGGT 6GATACACAT CTTGTCATAT GATCCC6GAT CTG6GCAATA 6TTACCCTTA 1980 

TTATCAAGAT AAGAAAGAAA AGGATTTTTC GCTACGCTCA AATCCTTTAA AAAAACACAA 2040 

AA6ACCACAT TTTTTAATGT GGTCTTTATT CTTCAACTAA AGCACCCATT AGTTCAACAA 2100 
ACGAAAATTG 6ATCCAAGGG GTGATCGGTC GGCGGAAATG AAGGCCTGCG GCGAGTGCGG 2160 
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GCCTTCTGTT 


TTGA66ATTA 


TAATCAGAGT 


ATATTGAAAG TTTCGCGATC 


TTTTCGTATA 


2220 


ATTGTTTTAG 


GCATA6T6CA 


ATC6ATTGTT 


TGAGAAAAGA A6AAGACCAT 


AAAAATACCT 


2280 


TGTCTGTCAT 


CAGACAGGGT 


ArTTTTTATG 


CTGTCCAGAC TGTCCGCTGT 


GTAAAAATAA 


2340 


GGAATAAAGG 


6GGGTTGTTA 


TTATTTTACT 


6ATATGTAAA ATATAATTTG 


TATAAGAAAA 


2400 


TGA6AGG6AG 


AGGAAACAT6 


ATTCAAAAAC 


6AAAGCGGAC AGTTTCGTTC 


AGACTT6T6C 


2460 


TTATGT6CAC 


6CTGTTATTT 


GTCAGTTT6C 


C6ATTACAAA AACATCAGCC 


GTAAATGGCA 


2520 


CGCTGAT6CA 


6TATTTTGAA 


T66TATACGC 


C6AACGACG6 CCAGCATT6G 


AAACGATTGC 


2580 


AGAATGATGC 


GGAACATTTA 


TC6GATTAAC 


TTAAC6TTAA TATTTGTTTC 


CCAATAGGCA 


2640 


AATCTTTCTA 


ACTTT6ATAC 


6TTTAAACTA 


CCAGCTTGGA CAAGTTGGTA 


TAAAAATGAG 


2700 


GAGGGAAACC 


GAATGAAGAA 


ACCGTTGGGG 


AAAATTGTCG CAAGCACCGC 


ACTACTCATT 


2760 


TCTGTTGCTT 


TTAGTTCATC 


GATC6CATCG 


GCTGCTGAAG AAGCAAAAGA 


AAAATATTTA 


2820 


ATTG6CTTTA 


ATGAGCAGGA 


A6CTGTCA6T 


6AGTTTGTAG AACAAGTAGA 


GGCAAATGAC 


2880 


GAGGTCGCCA 


TTCTCTCTGA 


GGAAGAGGAA 


GTCGAAATTG AATTGCTTCA 


TGAATTTGAA 


2940 


ACGATTCCTG 


TTTTATCC6T 


TGAGTTAAGC 


CCAGAAGATG TGGACGCGCT 


TGAACTC6AT 


3000 


CCAGCGATTT 


CTTATATTGA 


AGAGGATGCA 


GAAGTAACGA CAAT6GCGCA 


ATC6GTACCA 


3060 


TGG6GAATTA 


GCCGTGTGCA 


AGCCCCAGCT 


GCCCATAACC GTGGATT6AC 


AGGTTCTGGT 


3120 


GTAAAAGTTG 


CTGTCCTC6A 


TACAG6GATA 


TCCACTCATC CAGATCTAAA 


TATTCGTGGT 


3180 


GGCGCAAGCT 


TTGTACCA6G 


6GAACC6TC6 


ACTCAAGATG 6GAATGG6CA 


TGGCACGCAT 


3240 


6TG6CCGGGA 


C6ATCGCTGC 


TTTAAACAAT 


TCGATTGGCG TTCTTGGCGT 


AGCGCCGAGC 


3300 


6CTGAGCTAT 


ACGCTGTTAA 


AGTCCTAGG6 


GCGAGCGGTT CAGGTTCGGT 


CAGCTCGATT 


3360 


6CCCAAGGAT 


T66AAT6GGC 


AGGGAACAAT 


GGCATGCACG TTGCTAATTT 


GAGTTTAGGA 


3420 


AGCCCTTCGC 


CAAGTGCCAC 


ACTCGAGCAA 


GCTGTTAATA GCGCGACTTC 


TAGAGGCGTT 


3480 


PTTCTTrTAP 
LI 1 la 1 1 b 1 Au 




pa attpappt 


ppap/:ptpaa tpapptatpp 


PPPPPPPTAT 


oohu 


GCGAACGCAA 


TGGCAGTCGG 


AGCTACTGAT 


CAAAACAACA ACCGCGCTAG 


CTTTTCACAG 


3600 


TATGGCGCAG 


GCCTTGACAT 


TGTCGCACCC 


GGGGTAAACG TGCAGAGCAC 


ATACCCAGGT 


3660 


TCAACATATG 


CCAGCTTAAA 


CGGTACATCG 


ATGGCTACTC CTCATGTTGC 


AGGTGCGGCC 


3720 


GCCCTTGTTA 


AACAAAAGAA 


CCCATCTTGG 


TCTAATGTAC AAATTCGAAA 


TCATCTAAAG 


3780 


AATACGGCAA 


CTAGTTTAGG 


AAGCACGAAC 


TTGTATGGAA GCGGACTTGT 


TAACGCAGAA 


3840 
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GCS6CAAC6C GTTAATCAAT AAAAAAACGC TGTGCGGTTA AAGGGCACAG CG 1 1 1 1 1 1 1 G 3900 

TGTAT66ATC CA6T66GATA TTTTTAAAAT ATATATTTAT GTTACAGTAA TATT6ACTTT 3960 

TAAAAAA6GA TT6ATTCTAA T6AAGAAA6C A6ACAAGTAA 6CCTCCTAAA TTCACTTTAG 4020 

ATAAAAATTT A66A6GCATA TCAAATGAAC TTTAATAAAA TTGATTTAGA CAATT6GAAG 4080 

AGAAAAGAGA TATTTAATCA TTATTT6AAC CAACAAACGA CTTTTAGTAT AACCACAGAA 4140 

ATTGATATTA GTGTTTTATA CC6AAACATA AAACAA6AA6 GATATAAATT TTACCCT6CA 4200 

TTTATTTTCT TAGT6ACAAG 6GTGATAAAC TCAAATACAG CTTTTAGAAC TGGTTACAAT 4260 

AGCGAC66AG AGTTAG6TTA TTGGGATAAG TTAGAGCCAC TTTATACAAT TTTT6ATG6T 4320 

6TATCTAAAA CATTCTCT6G TATTTGGACT CCTGTAAAGA ATGACTTCAA A6AGTTTTAT 4380 

6ATTTATACC TTTCT6AT6T A6A6AAATAT AAT6GTTC6G 6GAAATTGTT TCCCAAAACA 4440 

CCTATACCTG AAAATGCTTT TTCTCTTTCT ATTATTCCAT GGACTTCATT TACT6GGTTT 4500 

AACTTAAATA TCAATAATAA TA6TAATTAC CTTCTACCCA TTATTACAGC AGGAAAATTC 4560 

ATTAATAAAG 6TAATTCAAT ATATTTACCG CTATCTTTAC AGGTACATCA TTCTGTTT6T 4620 

6AT6GTTATC ATGCAG6ATT GTTTATGAAC TCTATTCAG6 AATTGTCAGA TA66CCTAAT 4680 

6ACTG6CTTT TATAATATGA 6ATAATGCC6 ACTGTACTTT TTACAGTCGG TTTTCTAAT6 4740 

TCACTAACCT GCCCCGTTAG TTGAAGAAGG TTTTTATATT ACAGCTCCAG ATCCGGGATC 4800 

ATAT6ACAAG ATGTGTATCC ACCTTAACTT AATGATTTTT ACCAAAATCA TTAGGGGATT 4860 

CATCAGTGCT CAGGGTCAAC GAGAATTAAC ATTCCGTCA6 GAAAGCTTAG CTTATGATGA 4920 

T6ATGTGCTT AAAAACTTAC TCAATGGCTG GTTTAT6CAT ATCGCAATAC AT6CGAAAAA 4980 

CCTAAAAGAG CTTGCC6ATA AAAAAGGCCA ATTTATTGCT ATTTACCGCG 6CTTTTTATT 5040 

GAGCTTGAAA 6ATAAATAAA ATAGATAGGT TTTATTTGAA GCTAAATCTT CTTTATCGTA 5100 

AAAAATGCCC TCTTGGGTTA TCAAGAGGGT CATTATATTT CGC6GAATAA CATCATTT6G 5160 

TGACGAAATA ACTAAGCACT TGTCTCCT6T TTACTCCCCT GAGCTTGAGG G6TTAACATG 5220 

AAGGTCATCG ATAGAAAGCG TGAGAAACAG CGTACAGACG ATTTAGAGAT GTAGAGGTAC 5280 

TTTTAT6CCG AGAAAACTTT TT6CGTGTGA CAGTCCTTAA AATATACTTA GAGCGTAAGC 5340 

6AAAGTAGTA 6C6ACAGCTA TTAACTTTCG GTT6CAAA6C TCTAGGATTT TTAATGGACG 5400 

CAGCGCATCA CACGCAAAAA GGAAATTGGA ATAAATGCGA AATTT6A6AT GTTAATTAAA 5460 

6ACCTTTTT6 AGGTCTTTTT TTCTTAGATT TTTGGGGTTA TTTAGGGGAG AAAACATAGG 5520 
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G666TACTAC 


GACCTCCCCC 


CTA6GTGTCC 


ATTGTCCATT 


GTCCAAACAA 


ATAAATAAAT 


5580 


ATT66GTTTT 


TAATGTTAAA 


AGGTTGTTTT 


TTATGTTAAA 


GTGAAAAAAA 


CA6ATGTTGG 


5640 


6A6GTACAGT 


GATAGTTGTA 


GATAGAAAAG 


AAGAGAAAAA 


AGTTGCTGTT 


ACTTTAAGAC 


5700 


TTACAACA6A 


AGAAAATGAG 


ATATTAAATA 


GAATCAAAGA 


AAAATATAAT 


ATTAGCAAAT 


5760 


5 CA6AT6CAAC 


CGGTATTCTA 


ATAAAAAAAT 


ATGCAAA66A 


66AATACG6T 


6CATTTTAAA 


5820 


CAAAAAAAGA 


TAGACAGCAC 


TGGCAT6CT6 


CCTATCTATG 


ACTAAATTTT 


GTTAAGTGTA 


5880 


TTA6CACCGT 


TATTATATCA 


TGAGCGAAAA 


TGTAATAAAA 


GAAACTGAAA 


ACAAGAAAAA 


5940 


TTCAAGAGGA 


CGTAATTGGA 


CATTTGTTTT 


ATATCCAGAA 


TCAGCAAAAG 


CCGAGTGGTT 


6000 


AGAGTATTTA 


AAAGAGTTAC 


ACATTCAATT 


T6TAGTGTCT 


CCATTACATG 


ATAGGGATAC 


6060 


10 T6ATACAGAA 


GGTAGGATGA 


AAAAAGAGCA 


TTATCATATT 


CTAGTGATGT 


ATGAGGGTAA 


6120 


TAAATCTTAT 


GAACAGATAA 


AAATAATTAA 


CAGAAGAATT 


GAAT6C6ACT 


ATTCCGCAGA 


6180 


TTGCAGGAAG 


TGTGAAAGGT 


CTTGTGA6AT 


ATATGCTTCA 


CATGGACGAT 


CCTAATAAAT 


6240 


TTAAATATCA 


AAAAGAAGAT 


AT6ATAGTTT 


ATGGCGGTGT 


AGATGTTGAT 


GAATTATTAA 


6300 


AGAAAACAAC 


AACAGATAGA 


TATAAATTAA 


TTAAAGAAAT 


6ATTGAGTTT 


ATT6ATGAAC 


6360 


is AAGGAATCGT 


AGAATTTAAG 


AGTTTAATGG 


ATTATGCAAT 


GAAGTTTAAA 


TTTGAT6ATT 


6420 


G6TTCCCGCT 


TTTAT6TGAT 


AACTCGGCGT 


AT6TTATTCA 


AGAATATATA 


AAATCAAATC 


6480 


G6TATAAATC 


TGACCGATAG 


ATTTTGAATT 


TAGGTGTCAC 


AAGACACTCT 


TTTTTCGCAC 


6540 


CA6CGAAAAC 


T6GTTTAAGC 


CGACTGC6CA 


AAAGACATAA 


TCGGGAATTC 


CCGATTCACA 


6600 


AAAAATAGGC 


ACACGAAAAA 


CAAGTTAAGG 


6ATGCAGTTT 


ATGCATCCCT 


TAACTTACTT 


6660 


20 ATTAAATAAT 


TTATAGCTAT 


TGAAAAGAGA 


TAAGAATTGT 


TCAAAGCTAA 


TATTGTTTAA 


6720 


ATCGTCAATT 


CCTGCATGTT 


TTAAGGAATT 


GTTAAATTGA 


TTTTTTGTAA 


ATATTTTCTT 


6780 


6TATTCTTTG 


TTAACCCATT 


TCATAACGAA 


ATAATTATAC 


TTTTGTTTAT 


CTTTGTGTGA 


6840 


TATTTTTflAT 
1 A 1 1 L 1 1 b A 1 


TTTTTTCT A P 
II 1 II 1 U 1 AL 


TTAATPTC/1T 
1 1 AA 1 L 1 bA 1 


AAb 1 bAbl 1 A 


1 1 (/AL III Au 


bill Abb A 1 b 


03UU 


AAAATATTCT 


CTTGGAACCA 


TACTTAATAT 


AGAAATATCA 


ACTTCTGCCA 


TTAAAAGTAA 


6960 


25 TGCCAATGAG 


CGTTTTGTAT 


TTAATAATCT 


TTTAGCAAAC 


CCGTATTCCA 


CGATTAAATA 


7020 


AATCTCATTA 


GCTATACTAT 


CAAAAACAAT 


TTTGCGTATT 


ATATCCGTAC 


TTATGTTATA 


7080 


AGGTATATTA 


CCATATATTT 


TATAGGATTG 


6TTTTTAGGA 


AATTTAAACT 


GCAATATATC 


7140 


CTTGTTTAAA 


ACTT6GAAAT 


TATCGTGATC 


AACAAGTTTA 


TTTTCTGTAG 


TTTTGCATAA 


7200 
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TTTAT66TCT ATTTCAAT66 CA6TTACGAA ATTACACCTC TTTACTAATT CAAGGGTAAA 7260 

AT6GCCTTTT CCT6AGCC6A TTTCAAA6AT ATTATCATGT TCATTTAATC TTATATTTGT 7320 

CATTATTTTA TCTATATTAT GTTTT6AA6T AATAAA6TTT TGACTGTGTT TTATATTTTT 7380 

CTCGTTCATT ATAACCCTCT TTAATTT6GT TATATGAATT TTGCTTATTA AC6ATTCATT 7440 

ATAACCACTT ATTTTTT6TT TGGTT6ATAA T6AACTGT6C TGATTACAAA AATACTAAAA 7500 

AT6CCCATAT TTTTTCCTCC TTATAAAATT AGTATAATTA TA6CAC6AGC TCT6ATAAAT 7560 

ATGAACATGA TGAGTGATCG TTAAATTTAT ACTGCAATC6 6ATGCGATTA TT6AATAAAA 7620 

GATATGA6AG ATTTATCTAA TTTC 1 1 1 1 1 1 CTT6TAAAAA AAGAAAGTTC TTAAAGGTTT 7680 

TATA6TTTTG GTCGTAGAGC ACACGGTTTA AC6ACTTAAT TACGAAGTAA ATAA6TCTAG 7740 

T6T6TTAGAC TTTATGAAAT CTATATAC6T TTATATATAT TTATTATCC6 6AGGTGTAGC 7800 

ATGTCTCATT CAATTTT6AG GGTTGCCA6A GTTAAAGGAT CAA6TAATAC AAAC666ATA 7860 

CAAAGACATA ATCAAAGAGA GAATAAAAAC TATAATAATA AAGACATAAA TCATGAGGAA 7920 

ACATATAAAA ATTAT6ATTT GATTAACGCA CAAAATATAA AGTATAAAGA TAAAATT6AT 7980 

GAAACGATTG ATGAGAATTA TTCAGGGAAA CGTAAAATTC GGTCAGATGC AATTC6AC6A 8040 

TAA6CTAGCT TTAATGC66T AGTTTATCAC AGTTAAATT6 CTAACGCAGT CAGGCACCGT 8100 

6TAT6AAATC TAACAATGCG CTCATCGTCA TCCTCGGCAC CGTCACCCTG GATGCTGTAG 8160 

GCATAGGCTT GGTTATGCCG GTACTGCC66 6CCTCTTGC6 GGAT6CTCTT CCGCTTCCTC 8220 

6CTCACT6AC TCGCTGCGCT CGGTCGTTC6 GCTGCGGCGA 6C6GTATCAG CTCACTCAAA 8280 

66CG6TAATA CGGTTATCCA CAGAATCAGG GGATAACGCA GGAAAGAACA TGTGAGCAAA 8340 

AGGCCAGCAA AAGGCCAGGA ACCGTAAAAA GGCCGCGTTG CTGGCGTTTT TCCATAGGCT 8400 

CCGCCCCCCT GACGAGCATC ACAAAAATCG ACGCTCAAGT CAGAGGTGGC GAAACCCGAC 8460 

AGGACTATAA AGATACCAGG CGTTTCCCCC TGGAAGCTCC CTCGTGCGCT CTCCT6TTCC 8520 

GACCCTGCCG CTTACCGGAT ACCTGTCCGC CTTTCTCCCT TCGGGAAGCG TGGCGCTTTC 8580 

TCAATGCTCA CGCTGTAGGT ATCTCAGTTC GGTGTAGGTC GTTCGCTCCA AGCTGGGCTG 8640 

T6TGCACGAA CCCCCCGTTC A6CCCGACC6 CTGCGCCTTA TCCGGTAACT ATCGTCTT6A 8700 

6TCCAACCC6 GTAAGACACG ACTTATCGCC ACTGGCAGCA GCCACTGGTA ACAGGATTAG 8760 

CAGAGCGAGG TATGTAGGCG GTGCTACAGA GTTCTTGAAG TGGTGGCCTA ACTACGGCTA 8820 

CACTAGAAGG ACA6TATTTG 6TATCTGCGC TCTGCTGAAG CCAGTTACCT TCGGAAAAAG 8880 
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AGTT6GTAGC 


TCTTGATCCG 


6CAAACAAAC CACC6CTGGT AGCGGTG6TT TTTTTGTTT6 


8940 


CAAGCAGCAG 


ATTACGCGCA 


GAAAAAAAGG ATCTCAA6AA GATCCTTTGA TCTTTTCTAC 


9000 


66G6TCTGAC 


6CTCA6TGGA 


AC6AAAACTC ACGTTAAGGG ATTTTG6TCA T6AGATTATC 


9060 


AAAAAGGATC 


TTCACCTAGA 


TCCTTTTAAA TTAAAAAT6A AGTTTTAAAT CAATCTAAAG 


9120 


5 TATATATGAG 


TAAACTT66T 


CTGACAGTTA CCAATGCTTA ATCAGTGAG6 CACCTATCTC 


9180 


AGCGATCT6T 


ctatttc6tt 


CATCCATAGT TGCCTGACTC CCC6TCGTGT AGATAACTAC 


9240 


GATACGGGAG 


GGCTTACCAT 


CT6GCCCCA6 T6CTGCAATG ATACCGC6AG ACCCACGCTC 


9300 


ACC6GCTCCA 


GATTTATCAG 


CAATAAACCA 6CCAGCCGGA A66GCCGAGC GCAGAAGTGG 


9360 


TCCTGCAACT 


TTATCC6CCT 


CCATCCAGTC TATTAATTGT TGCCGGGAAG CTAGAGTAAG 


9420 


io TAGTTCGCCA 


GTTAATAGTT 


TGCGCAAC6T TGTT6CCATT 6CTGCAG6CA TCGTGGTGTC 


9480 


ACGCTCGTC6 


TTT6GTAT66 


CTTCATTCA6 CTCCGGnCC CAACGATCAA GGCGAGTTAC 


9540 


ATGATCCCCC 


ATGTTGTGCA 


AAAAA6CGGT TAGCTCCTTC GGTCCTCCGA TCGTTGTCA6 


9600 


AAGTAAGTTG 


GCCGCAGTGT 


TATCACTCAT 6GTTATGGCA GCACTGCATA ATTCTCTTAC 


9660 


T6TCAT6CCA 


TCCGTAAGAT 


GCTTTTCTGT GACTGGTGAG TACTCAACCA AGTCATTCTG 


9720 


15 AGAATAGTGT 


ATGCGGCGAC 


CGAGTTGCTC TTGCCCGGCG TCAACACGGG ATAATACC6C 


9780 


GCCACATA6C 


AGAACTTTAA 


AAGTGCTCAT CATTGGAAAA CGTTCTTCG6 GGCGAAAACT 


9840 


CTCAAGGATC 


TTACCGCTGT 


TGAGATCCAG TTCGAT6TAA CCCACTCGTG CACCCAACTG 


9900 


ATCTTCAGCA 


TCTTTTACTT 


TCAfTAGPGT TTPTCnRTRA GPAAAAAPAft £AA££PA AAA 
I unuV/MUtu i i 1 1/ 1 quo 1 \2f\ bwAMMHMUnvi QHAuut AMAA 




TGCCGCAAAA 


AAGGGAATAA 


6GGC6ACACG 6AAATGTTGA ATACTCATAC TCTTCCTTTT 


10020 


20 TCAATATTAT 


TGAA6CATTT 


ATCAGGGTTA TTGTCTCATG AGCGGATACA TATTTGAAT6 


10080 


TATTTAGAAA 


AATAAACAAA 


TAGGGGTTCC GCGCACATTT CCCCGAAAAG TGCCACCT6A 


10140 


CGTCTAAGAA 


ACCATTATTA 


TCATGACATT AACCTATAAA AATAGGCGTA TCACGAGGCC 


10200 


CTTTCGTCTT 


CAAGAA 




10216 



(2) INFORMATION FOR SEQ ID NO: 2: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6169 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 
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(11) MOLECULE TYPE: other nucleic add 

(A) DESCRIPTION: /desc - "pSJ2739" 

(vl) ORIGINAL SOURCE: 

(A) ORGANISM: pSJ2739 



s (Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

TCTG6ACT6T CCAAACAT66 TTTAA6CC6C TTGCTTACGC TTTTATTCTC ACAAGGGAAT 60 

CT6GATACCC TCAGGTTTTC TACG6GGATA TGTACGGGAC GAAAGGAGAC TCCCAGCGC6 120 

AAATTCCTGC CTTGAAACAC AAAATTGAAC CGATCTTAAA AGCGAGAAAA CAGTAT6CGT 180 

AC66AGCACA 6CAT6ATTAT TTC6ACCACC AT6ACATTGT CGGCTGGACA AGGGAAGGCG 240 

10 ACAGCTCGGT TGCAAATTCA 6GTTT6GCG6 CATTAATAAC AGAC6GACCC GGTGGGGCAA 300 

A6C6AATGTA T6TCGGCCGG CAAAACGCC6 GTGAGACATG 6CATGACATT ACCGGAAACC 360 

GTTC6GAGCC GGTTGTCATC AATTCGGAAG GCTGGGGAGA GTTTCACGTA AACGGCGGGT 420 

CGGTTTCAAT TTAT6TTCAA A6ATAGAA6A GCAGAGAGGA CGGATTTCCT 6AA6GAAATC 480 

C6 l 1 1 1 1 1 1 A TTTT6CCCGT CTTATAAATT TCGTTGATTA CATTTTATAA TTAATTTTAA 540 

15 CAAA6TGTCA TAA6CCCTCA GGAATATTGC T6ACAGTTTA 6AATCCCTA6 GTAAGGC66G 600 

GATGAAATGG CAACGTTATC T6ATGTAGCA AAGAAAGAAA TGTGTCGAAA ATGACG6TAT 660 

C6C666TGAT CAATCATCCT GAGACTGTGA CG6AT6AATT 6AAAAAGCTT 6CATGCCT6C 720 

AGGTCGATTC ACAAAAAATA GGCACACGAA AAACAAGTTA AGGGATGCAG TTTATGCATC 780 

CCTTAACTTA CTTATTAAAT AATTTATAGC TATTGAAAAG AGATAAGAAT TGTTCAAAGC 840 

20 TAATATTGTT TAAATCGTCA ATTCCTGCAT GTTTTAAGGA ATTGTTAAAT TGATTTTTTG 900 

TAAATATTTT CTTGTATTCT TTGTTAACCC ATTTCATAAC GAAATAATTA TACTTTTGTT 960 

TATCTTTGTG TGATATTCTT GATTTTTTTC TACTTAATCT GATAAGTGAG CTATTCACTT 1020 

TA6GTTTAGG ATGAAAATAT TCTCTTGGAA CCATACTTAA TATAGAAATA TCAACTTCTG 1080 

CCATTAAAAG TAATGCCAAT GAGCGTTTTG TATTTAATAA TCTTTTAGCA AACCCGTATT 1140 

25 CCAC6ATTAA ATAAATCTCA TTAGCTATAC TATCAAAAAC AATTTTGCGT ATTATATCC6 1200 

TACTTAT6TT ATAAGGTATA TTACCATATA TTTTATAGGA TTGGTTTTTA GGAAATTTAA 1260 

ACTGCAATAT ATCCTTGTTT AAAACTTGGA AATTATCGT6 ATCAACAAGT TTATTTTCTG 1320 

TAGTTTTGCA TAATTTATGG TCTATTTCAA TGGCAGTTAC GAAATTACAC CTCTTTACTA 1380 
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ATTCAAG6GT 


AAAATGGCCT 


TTTCCT6AGC CGATTTCAAA 


GATATTATCA TGTTCATTTA 


1440 


ATCTTATATT 


T6TCATTATT 


nATCTATAT TATGTTTTGA 


AGTAATAAAG TTTT6ACT6T 


1500 


GTTTTATATT 


TTTCTCGTTC 


ATTATAACCC TCTTTAATTT 


GGTTATATGA ATTTTGCTTA 


1560 


TTAACGATTC 


ATTATAACCA 


CTTATTTTTT GTTTGGTTGA 


TAAT6AACTG TGCTGATTAC 


1620 


AAAAATACTA 


AAAATGCCCA 


TAIN IMCC TCCTTATAAA 


ATTAGTATAA TTATAGCACG 


1680 


AGCTCTGATA 


AATATGAACA 


TGATGAGT6A TCGTTAAATT 


TATACTGCAA TC6GAT6CGA 


1740 


TTATTGAATA 


AAA6ATATGA 


GAGATTTATC TAATTTCTTT 


TTTCTTGTAA AAAAAGAAAG 


1800 


TTCTTAAAGG 


TTTTATAGTT 


TTGGTCGTAG AGCACACGGT 


TTAACGACTT AATTACGAA6 


1860 


TAAATAAGTC 


TA6TGTGTTA 


GACTTTATGA AATCTATATA 


CGTTTATATA TATTTATTAT 


1920 


CC6GAGGTGT 


AGCAT6TCTC 


ATTCAATTTT 6AGG6TTGCC 


AGAGTTAAAG GATCAAGTAA 


1980 


TACAAACGGG 


ATACAAAGAC 


ATAATCAAAG AGAGAATAAA 


AACTATAATA ATAAAGACAT 


2040 


AAATCATGAG 


GAAACATATA 


AAAATTATGA TTT6ATTAAC 


GCACAAAATA TAAAGTATAA 


2100 


AGATAAAATT 


6ATGAAACGA 


TTGATGAGAA TTATTCAGGG 


AAACGTAAAA TTCGGTCAGA 


2160 


TGCAATTCGA 


CATGT66ACG 


GACTGGTTAC AAGTGATAAA 


GATTTCTTTG ATGATTTAAG 

9mm » ■ ■ m www v ri I w ■ i w Www 9 %• 


2220 


C66AGAAGAA 


ATAGAACGAT 


TTTTTAAAGA TA6CTTGGAG 


TTTCTAGAAA ATGAATACGG 


2280 


TAAGGAAAAT 


ATGCTGTATG 


CGACTGTCCA TCTGGATGAA 


AGAGTCCCAC ATATGCACTT 


2340 


T66TTTTGTC 


CCTTTAACAG 


AGGACGGGAG ATTGTCTGCA 


AAAGAACAGT TAG6CAACAA 


2400 


6AAAGACTTT 


ACTCAATTAC 


AAGATAGATT TAATGAGTAT 


GTGAATGAGA AAGGTTATGA 


2460 


ACTTGAAAGA 


GGCACGTCCA 


AAGAGGTTAC AGAACGAGAA 


CATAAAGCGA TGGATCAGTA 


2520 


CAAGAAAGAT 


ACTGTATTTC 


ATAAACAGGA ACTGCAAGAA 


GTTAAGGATG AGTTACAGAA 


2580 


G6CAAATAAG 


CAGTTACAGA 


GTGGAATAGA GCATATGAGG 


TCTACGAAAC CCTTTGATTA 


2640 


TGAAAATGAG 


CGTACAGGTT 


TGTTCTCTGG ACGTGAAGAG 


ACTGGTAGAA AGATATTAAC 


2700 


IbllbAlbAA 


I TTGAACGCC 


TGCAAGAAAC AATCTCTTCT 


GCAGAACGGA TTGTTGATGA 


r\ ~t r r\ 

2760 


TTACGAAAAT 


ATTAAGAGCA 


CAGACTATTA CACAGAAAAT 


CAAGAATTAA AAAAACGTAG 


2820 


AGAGAGTTTG 


AAAGAAGTAG 


TGAATACAT6 GAAAGAGGGG 


TATCACGAAA AAAGTAAAGA 


2880 


6GTTAATAAA 


TTAAAGCGAG 


AGAATGATAG TTTGAATGAG 


CAGTTGAATG TATCAGAGAA 


2940 


ATTTCAAGCT 


AGTACAGTGA 


CTTTATATCG TGCTGCGAGG 


GCGAATTTCC CTGGGTTTGA 


3000 


GAAAGGGTTT 


AATAGGCTTA 


AAGAGAAATT CTTTAATGAT 


TCCAAATTTG AGCGTGTGGG 


3060 
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ACA6TTTAT6 6ATGTTGTAC A66ATAAT6T CCA6AA66TC 6ATA6AAAGC GT6AGAAACA 3120 

6CGTACAGAC GATTTAGAGA TGTAGAGGTA CTTTTAT6CC GAGAAAACTT TTTGCGTGTG 3180 

ACAGTCCTTA AAATATACTT AGAGCGTAAG CGAAAGTAGT AGCGACAGCT ATTAACTTTC 3240 

66TTTCAAA6 CTCTAG6ATT TTTAATG6AC GCAGCGCATC ACACGCAAAA AGGAAATTGG 3300 

AATAAAT6C6 AAATTT6A6A TGTTAATTAA AGACCTTTTT 6A66TCTTTT TTTCTTA6AT 3360 

TTTTGGGGTT ATTTAGGG6A 6AAAACATAG G6GGGTACTA CGACCTCCCC CCTAG6T6TC 3420 

CATT6TCCAT T6TCCAAACA AATAAATAAA TATTGG6TTT TTAATGTTAA AAGGTTGTTT 3480 

TTTATGTTAA AGTGAAAAAA ACAGATGTTG 6GAGGTACAG TGATGGTTGT AGATAGAAAA 3540 

6AA6A6AAAA AAGTTGCTGT TACTTTAAGA CTTACAACAG AAGAAAATGA GATATTAAAT 3600 

AGAATCAAAG AAAAATATAA TATTAGCAAA TCAGATGCAA CCG6TATTCT AATAAAAAAA 3660 

TATGCAAAGG AGGAATACGG T6CATTTTAA ACAAAAAAAG ATAGACAGCA CT6GCATGCT 3720 

6CCTATCTAT 6ACTAAATTT T6TTAAGT6T ATTAGCACCG TTATTATATC ATGA6CGAAA 3780 

ATGTAATAAA A6AAACT6AA AACAA6AAAA ATTCAAGAG6 ACGTAATTGG ACATTTGTTT 3840 

TATATCCA6A ATCA6CAAAA GCCGAGTGGT TAGAGTATTT AAAAGAGTTA CACATTCAAT 3900 

TTGTAGTGTC TCCATTACAT GATAGGGATA CTGATACAGA AGGTAGGATG AAAAAAGAGC 3960 

ATTATCATAT TCTAGTGATG TAT6AGGGTA ATAAATCTTA T6AACAGATA AAAATAATTA 4020 

CAGAAGAATT GAATGC6ACT ATTCCGCAGA TTGCA6GAAG TGTGAAAGGT CTTGT6AGAT 4080 

ATATGCTTCA CATG6AC6AT CCTAATAAAT TTAAATATCA AAAA6AAGAT AT6ATAGTTT 4140 

ATGGCGGTGT AGAT6TTGAT 6AATTATTAA AGAAAACAAC AACAGATAGA TATAAATTAA 4200 

TTAAAGAAAT GATTGA6TTT ATTGATGAAC AAGGAATC6T AGAATTTAAG AGTTTAATGG 4260 

ATTATGCAAT GAA6TTTAAA TTTGATGATT G6nCCCGCT TTTATGT6AT AACTC6GCGT 4320 

AT6TTATTCA AGAATATATA AAATCAAATC G6TATAAATC TGACCGATAG ATTTT6AATT 4380 

TAGGTGTCAC AAGACACTCT TTTTTC6CAC CAGCGAAAAC TGGTTTAAGC CGACTGCGCA 4440 

AAAGACATAA TCGACTCTAG AGGATCCCCG GGTACCGAGC TCT6CCTTTT AGTCCAGCTG 4500 

ATTTCACTTT TTGCATTCTA CAAACTGCAT AACTCATATG TAAATCGCTC CTTTTTAGGT 4560 

GGCACAAATG TGAGGCATTT TCGCTCTTTC CGGCAACCAC TTCCAAGTAA AGTATAACAC 4620 

ACTATACTTT ATATTCATAA AGTGTGTGCT CTGCGAGGCT GTC6GCAGTG CCGACCAAAA 4680 

CCATAAAACC TTTAAGACCT TTCTTTTTTT TACGAGAAAA AAGAAACAAA AAAACCTGCC 4740 
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CTCT6CCACC TCAGCAAA66 GGGGTTTTGC TCTCGTGCTC GTTTAAAAAT CAGCAAGGGA 4800 
CAGGTAGTAT TTTTTGAGAA GATCACTCAA AAAATCTCCA CCTTTAAACC CTTGCCAATT 4860 
TTTATTTT6T CCGTTTTGTC TAGCTTACCG AAAGCCAGAC TCAGCAAGAA TAAAATTTTT 4920 
ATTGTCTTTC G6TTTTCTAG T6TAAC6GAC AAAACCACTC AAAATAAAAA AGATACAA6A 4980 
5 6AG6TCTCTC GTATCTTTTA TTCAGCAATC GCGCCCGATT GCTGAACAGA TTAATAATGA 5040 
6CTCGAATTC AT6AGATCCA AATGTAAAA6 TTCAAATGAT TC6ACCGAAA AATAAATATA 5100 
AATC6GATAT ACAATCGGCA ATTGACGAAA CTGCAAAATA TCCTGTAAAG GATACGGATT 5160 

TTATGACC6A T6ATGAAGAA AAGAATTTGA AACGTTT6TC TGATTTGGAG GAAGGTTTAC 5220 

ACCGTAAAAG 6TTAATCTCC TATGGTGGTT TGTTAAAAGA AATACATAAA AAATTAAACC 5280 

TTGATGACAC A6AAGAAGGC GATTTGATTC ATACAGAT6A TGAC6AAAAA GCCGATGAAG 5340 

ATG6ATTTT6 TATTATTGCA ATGTGGAATT GGGAACGGAA AAATTATTTT ATTAAAGA6T 5400 

AGTTCAACAA ACG6GCCAGT TTGTTGAAGA TTAGATGCTA TAATTGTTAT TAAAAGGATT 5460 

6AAGGATGCT TAG6AAGAC6 AGTTATTAAT AGCTGAATAA GAACGGTGCT CTCCAAATAT 5520 

TCTTATTTAG AAAAGCAAAT CTAAAATTAT CTGAAAAGGG AATGAGAATA GTGAATGGAC 5580 

CAATAATAAT GACTAGAGAA GAAAGAATGA AGATT6TTCA TGAAATTAAG GAACGAATAT 5640 

TGGATAAATA TGGGGATGAT GTTAAGGCTA TT6GTGTTTA TGGCTCTCTT GGTC6TCAGA 5700 

CT6AT66GCC CTATTCGGAT ATTGAGAT6A TGTGTGTCAT GTCAACAGAG GAAGCAGAGT 5760 

TCAGCCATGA ATGGACAACC GGTGA6TGGA AGGTGGAAGT GAATTTTGAT AGCGAAGAGA 5820 

TTCTACTA6A TTATGCATCT CAGGTGGAAT CAGATTGGCC 6CTTACACAT GGTCAATTTT 5880 

TCTCTATTTT GCCGATTTAT GATTCAGGTG GATACTTAGA GAAAGTGTAT CAAACTGCTA 5940 

AATCGGTAGA AGCCCAAACG TTCCACGATG CGATTTGTGC CCTTATCGTA GAAGAGCTGT 6000 

TTGAATATGC AGGCAAAT6G CGTAATATTC GTGTGCAAGG ACCGACAACA TTTCTACCAT 6060 

CCTTGACTGT ACAGGTAGCA ATGGCAGGTG CCATGTTGAT TGGTCTGCAT CATCGCATCT 6120 

GTTATACGAC GAGCGCTTCG GTCTTAACTG AAGCAGTTAA GCAATCAGA 6169 

(2) INFORMATION FOR SEQ ID NO: 3: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEONESS: single 

(D) TOPOLOGY: linear 
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(11) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - -LWN5037" 



(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CCCACTGGAT CCAATTTTCG TTT6TT6 
5 (2) INFORMATION FOR SEQ ID NO: 4: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 
to (D) T0P0L06Y: linear 

(11) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc « -LWN5038" 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GCAAATTGAT CCAA6A6AAC CAAC 

is (2) INFORMATION FOR SEQ ID NO: 5: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 

B) TYPE: nucleic acid 

C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(11) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - a LUN5036 a 



(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CAACAAACGA AAATTGGATC CAGTGGG 
25 (2) INFORMATION FOR SEQ ID NO: 6: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "LWN5039" 



(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GCACATCATC ATCATAAGC 



19 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic add 

s (C) STRANDEDNESS: single 

(D) T0P0L06Y: linear 

(11) MOLECULE TYPE: other nucleic add 

(A) DESCRIPTION: /desc « "LWN5136" 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
io CC66CGGATC CAA6G66T6A TCG 

(2) INFORMATION FOR SEQ ID NO: 8: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic add 

is (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) HOLECULE TYPE: other nucleic add 

(A) DESCRIPTION: /desc - "LWN2043" 



(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
20 GGGGTACTAG TAACCCGGGC CCGGCGTAGA GGATCCATAC ACAAA 45 
(2) INFORMATION FOR SEQ ID NO: 9: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "LWN5232" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
30 GTCGGAGCTC ATTATTAATC TGTTCAGCAA TCGGGC 36 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic add 

35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(11) MOLECULE TYPE: other nucleic add 

(A) DESCRIPTION: /desc - "LWN5233" 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GTC66A6CTC T6CCTTTTAB TCCAGCT6AT TTCAC . 35 

5 (2) INFORMATION FOR SEQ ID NO: 11: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 
no (D) TOPOLOGY: linear 

(11) MOLECULE TYPE: other nucleic add 

(A) DESCRIPTION: /desc - "LWN7839" 



(X1) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GACGGGATCC CTGCAGTATC CAATTTATTT TTTTCTTAAC AAG6 44 
is (2) INFORMATION FOR SEQ ID NO: 12: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(11) MOLECULE TYPE: other nucleic add 

(A) DESCRIPTION: /desc - "LWN7840" 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GACGGATTCA AAGCTTAAAG CACTTGCATA GGCTAATGCC 40 
25 (2) INFORMATION FOR SEQ ID NO: 13: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 
so (D) TOPOLOGY: linear 

(11) MOLECULE TYPE: other nucleic add 

(A) DESCRIPTION: /desc • "LWN7794" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GACGGGTACC ACGCGTTAAT CAATAAAAAA ACGCTGTGC6 GTTAAA 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: other nucleic add 

(A) DESCRIPTION: /desc - "LWN7790 



■ 



(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
io TATATATTTT AAAAATATCC CACGGTTCTT CAAATATTTC TCC 43 
(2) INFORMATION FOR SEQ ID NO: 15: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic add 

is (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: other nucleic add 

(A) DESCRIPTION: /desc - "LHN7789" 

(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
20 G6A6AAATAT TT6AAGAACC GTG66ATATT TTTAAAATAT ATAT 44 
(2) INFORMATION FOR SEQ ID NO: 16: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic add 

25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(II) MOLECULE TYPE: other nucleic add 

(A) DESCRIPTION: /desc - "LWN7788" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
30 CAAGTGTTCG CTTCGCTCTC AC6GAGCTGT AATATAAAAA CCTTC 45 
(2) INFORMATION FOR SEQ ID NO: 17: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(11) MOLECULE TYPE: other nucleic add 

(A) DESCRIPTION: /desc - "LWN7787" 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
6AAS6TTTTT ATATTACA6C TCCBTSA6AG CGAAGC6AAC ACTTG 45 
s (2) INFORMATION FOR SEQ ID NO: 18: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
10 (D) TOPOLOGY: linear 

(11) MOLECULE TYPE: other nucleic add 

(A) DESCRIPTION: /desc - "LWN7784" 



(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CATAT6ATCA AAT66TTC6G ATCT6ATTTT CCTCCTCTAA TATGC 45 
is (2) INFORMATION FOR SEQ ID NO: 19: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 
zo (D) TOPOLOGY: linear 

(11) MOLECULE TYPE: other nucleic add 

(A) DESCRIPTION: /desc - "LWN8197" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
6CATATTAGA G6AGGAAAAT CAGATCCGAA CCATTTGATC ATATGACAAG ATGTG 55 
25 (2) INFORMATION FOR SEQ ID NO: 20: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic add 

(A) DESCRIPTION: /desc « "LHN7791" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
6ACGGAATTC CCGCGGTAAA TAGCAATAAA TTGGC 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(1i) MOLECULE TYPE: other nucleic add 

(A) DESCRIPTION: /desc - "LWN7780" 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
10 6AC6G6TACC AC6C6TTAAT C 21 
(2) INFORMATION FOR SEQ ID NO: 22: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic add 

is (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(i1) MOLECULE TYPE: other nucleic add 

(A) DESCRIPTION: /desc - "LWN5067" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
20 CCA6AACCT6 TCAATCCAC6 20 
(2) INFORMATION FOR SEQ ID NO: 23: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LEN6TH: 50 base pairs 

(B) TYPE: nucleic acid 

25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc » "LWN8524" 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
30 GACTGAATTC GGATCCACGC GTATAATAAA GAATAATTAT TAATCTGTAG 50 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(11) MOLECULE TYPE: other nucleic add 

(A) DESCRIPTION: /desc - "LWN8528" 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
6ACTAAGCTT GAGCTCCACT AATATTAATA AACTATCGAA GG 42 
s (2) INFORMATION FOR SEQ ID NO: 25: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 
io (D) T0P0L06Y: linear 

(11) MOLECULE TYPE: other nucleic add 

(A) DESCRIPTION: /desc - "LHN8529" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
6ACT6AATTC CTGCAGGAGC TCAGTGAGAG CGAAGCGAAC AC 42 
is (2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(1i) MOLECULE TYPE: other nucleic add 

(A) DESCRIPTION: /desc - "LWN853r 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

6ACTAAGCTT TGATCAAATG GTTGCGGCCG CGTCGACTCT AGACCCGGGT ACCAGATCTG 60 

25 6ATCCTC66G TTCTTCAAAT ATTTCTCC 88 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 39 base pairs 
(8) TYPE: nucleic acid 
so (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(1i) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "LWN8518" 
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(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
6ACTAAGCTT ACGCGTTCGG GTTCTTCAAA TATTTCTCC 
(2) INFORMATION FOR SEQ ID NO: 28: 

(1) SEQUENCE CHARACTERISTICS: 
s (A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: other nucleic add 
10 (A) DESCRIPTION: /desc - "LWN8527" 

(X1) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
6ACT6AATTC TGATCAAATG GTTCAGTGAG AGCGAAGCGA ACAC 
(2) INFORMATION FOR SEQ ID NO: 29: 

(1) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: other nucleic acid 
20 (A) DESCRIPTION: /desc - "LWN7191" 

(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

6TTTTCCCAG TCACGAC 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
30 (A) DESCRIPTION: /desc = "LWN8516" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
GACTGAATTC GGATCCACGC GTGAGTAGTT CAACAAACGG GCC 
(2) INFORMATION FOR SEQ ID NO: 31: 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 
5 (D) T0P0L06Y: linear 

(11) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "LWN8517" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GACTAA6CTT GA6CTCCAAC ATGATTAACA ATTATTAGA6 6 
10 (2) INFORMATION FOR SEQ ID NO: 32: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LEN6TH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "LWN8760" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GACGGAATTC TCTAGAGTC6 ACAGATCC6A ACCATTTGAT CATATGACAA GATGTG 56 
20 (2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 
2 5 (D) T0P0L06Y: linear 

(II) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "LWN8761» 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
GACGGAATTC GC6GCCGCGG TAAATAGCAA TAAATTGGCC 
so (2) INFORMATION FOR SEQ ID NO: 34: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "LWN4123 
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(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

AGCG6ATAAC AATTTCACAC AGGA 

(2) INFORMATION FOR SEQ ID NO: 35: 

(1) SEQUENCE CHARACTERISTICS: 
s (A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: other nucleic acid 
10 (A) DESCRIPTION: /desc - "Tnasel" 



(X1) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

T6ACG6ATCC AC6C6T6GCG CACTCCC6TT CTGG 

(2) INFORMATION FOR SEQ ID NO: 36: 

(1) SEQUENCE CHARACTERISTICS: 
is (A) LENGTH: 36 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: other nucleic add 
20 (A) DESCRIPTION: /desc - "Tnase2" 



(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
GTACG6ATCC ACGCGTAAAG GCACCTTTGG TCACGG 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRulc \3bis) 



\ A. The indications made below relate to the microorganism referred to in the description 



on page 



40 



.lines 25-27 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet ft | 



Name of depositary institution 

DEUTSCHE SAMMLUNG VON MIKROORGANISMEN UND ZELL- 
KULTUREN GmbH , , 



Address of depositary institution (including postal code and country) 



Mascheroder Weg lb, D- 3 8 12 4 Braunschweig, Federal Re- 
public of Germany 



| Date of deposit 

| 22 December 1995 


Accession Number 

DSM10445 


C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet Q 



In respect of those designations in which a European 
and/or Australian patent is sought, during the 
pendency of the patent application a sample of the 
deposited microorganism is only to be provided to an 
independent expert nominated by the person requesting 
the sample (Rule 28(4) EPC / Regulation 3.25 of 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



TTieindications listed belowwillbe submitted to the International Bu reaulater (specify the general nature of Vindications e.g., 'Accession 
Number of Deposit') 



For receiving Office use only 



|"^| This sheet was received with tbe international application 



Authorized officer 





For International Bureau use only 



[ | This sheet was received by tbe International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRuIc 13Wj) 



A. The indications made below relate to the microorganism referred to in the description 



on page 



47 



.lines 10-12 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet Q 



Name of depositary institution 



DEUTSCHE SAMMLUNG VON MIKROORGANISMEN UND ZELL- 
KULTUREN GmbH 



Address of depositary institution (including postal code and country) 



Mascheroder Weg lb, D-38124 Braunschweig, Federal Re- 
public of Germany 



Date of deposit 



22 December 1995 



Accession Number 



DSM10444 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) T*is information is continued on an additional sheet Q 



In respect of those designations in which a European 
and/ or Australian patent is sought, during the 
pendency of the patent application a sample of the 
deposited microorganism is only to be provided to an 
independent expert nominated by the person requesting 
the sample (Rule 28(4) EPC / Regulation 3.25 of 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE ^^^c fl Wa,e^/ora//ici^^5i^ 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later [specify the general nature of the indications eg., 'Accession 
Number of Deposit ) ' *' 



For receiving Office use only 



fx] s beet w as received with the international application 



Authorized officer 



Form PCI7RO/134(July 1992) 



For International Bureau use only 



□ 

This sheet was received by the Internationa! Bureau < 



Authorized officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRuIc 13ta) 



A. The indications made below relate to the microorganism referred to in tbe description 
47 .lines 25-27 



on page 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet f"""| 



Name of depositary institution 

DEUTSCHE SAMMLUNG VON MIKROORGANISMEN UND ZELL- 
KULTUREN GmbH 



Address of depositary institution (including postal code and country) 

Mascheroder Weg lb, D-38124 Braunschweig, Federal Re- 
public of Germany 



Date of deposit 



22 December 1995 



Accession Number 



DSM10446 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet Q 



In respect of those designations in which a European 
and/or Australian patent is sought, during the 
pendency of the patent application a sample of the 
deposited microorganism is only to be provided to an 
independent expert nominated by the person requesting 
the sample (Rule 28(4) EPC / Regulation 3.25 of 
Australia Statut ory Pnl^g 1QQ1 No 71), 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not (or all designated Stoics) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications cg. t 'Accession 
Number of Deposit') 



For receiving Office use only 



["^j This sheet was received with tbe international application 



Authorized officer 




For International Bureau use only 



| [ This sheet was received by the International Bureau on: 



Authorized officer 



Form PCI7RO/134(July 1992) 
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CLAIMS 

1. A DNA construct comprising the structure IR(1) -P-R-M2-R- 
IR(2) or IR(l)-R-M2-R-P-iR(2) , wherein 

IR(1) and IR(2) denote transposase target sequences, 
5 P a DNA sequence of interest, 

R a target sequence for a site-specific recombination enzyme, 
and 

M2 a selectable marker gene, 

said structure being associated with a transposase gene T which 
10 is located on either side of and outside said structure. 

2. A DNA construct comprising the structure IR(1) -R-M2-T-R-P- 
IR(2), IR(1)-P-R-M2-T-R-IR(2), IR (1) -R-T-M2-R-P-IR (2 ) or IR(1)- 
P-R-T-M2-R-IR(2) , wherein 

IR(1) and IR(2) denote transposase target sequences, 
15 P a DNA sequence of interest, 

R a target sequence for a site-specific recombination enzyme, 
M2 a selectable marker gene, and 
T a transposase gene T. 

3. The DNA construct according to claim 1 or 2, wherein R is a 
20 res-site from plasmid pAM/91 or a lox site from phage Pi. 

4. A DNA construct comprising the structure IR(1) -P-R , -M2-R l f - 
IR(2) or IRUJ-R'-lte-R 1 «-P-IR(2) , wherein 

IR(1) and IR(2) denote transposase target sequences, 
P a DNA sequence of interest, 
25 R 1 and R' ■ parallel repeat sequences, and 
M2 a selectable marker gene, 

said structure being associated with a transposase gene T which 
is located on either side of and outside said structure. 

5. A DNA construct comprising the structure IR(1)-R'-M2-T-R I 
30P-IR(2), IR(1)-P-R'-M2-T-R' »-IR(2) , IR(1) -R» -T-M2-R' »-P-IR(2) 

or IR(1)-P-R , -T-M2-R' '-IR(2) , wherein 
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IR(1) and IR(2) denote transposase target sequences, 
P a DNA sequence of interest, 
R 1 and R 1 ■ parallel repeat sequences, 
M2 a selectable marker gene, and 
5 T a transposase gene T. 

6. The DNA construct according to any of claims 1-5, which 
further comprises a selectable marker gene, Ml, which may be 
different from or identical with M2, and which is located on 
either side of and outside the structure located within and 

io comprising IR(1) and IR(2) . 

7. The DNA construct according to any of the preceeding claims, 
which is capable of being transposed in a gram-positive 
bacterium, in particular a cell of Bacillus . 

8. The DNA construct according to any of claims 1-7, which 
15 further comprises a cis-acting DNA sequence required for the 

transfer of the DNA construct into a recipient cell by con- 
jugation in the presence of a trans-acting mobilizing element. 

9. The DNA construct according to any of claims 1-8, in which 
the DNA sequence P encodes a polypeptide of interest. 

20 10. The DNA construct according to claim 9, in which the DNA 
sequence P encodes a translocated polypeptide. 

11. The DNA construct according to claim 10, wherein the DNA 
sequence P encodes a secreted polypeptide. 

12. The DNA construct according to claim 9, wherein the 
25 polypeptide is an enzyme. 

13. The DNA construct according to claim 12, wherein the enzyme 
is selected from the group consisting of an amylolytic enzyme, 
a lipolytic enzyme, a proteolytic enzyme, a cellulytic enzyme, 
an oxidoreductase or a plant cell-wall degrading enzyme. 
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14. The DNA construct according to claim 10 , wherein the 
polypeptide is PrsA. 

15. A vector comprising the DNA construct according to any of 
5 claims 1-14. 

16. The vector according to claim 15, which comprises a condi- 
tional origin of replication. 

17. The vector according to claim 16, in which the conditional 
origin of replication is a temperature sensitive origin of 

10 replication. 

18. A bacterial cell, which in its constitutive DNA has 
integrated at least two copies of a DNA construct comprising 
the following structure IR(1) -P-IR(2) , in which 

IR(1) and IR(2) denote transposase target sequences, and 
15 P a DNA sequence of interest, the structure further comprising 
a DNA sequence R being a target sequence for a site-specific 
recombination enzyme which is present as a result of excision 
of a DNA fragment between two R sequences by the action of a 
resolvase enzyme or the DNA sequence resulting from the homolo- 
20 gous recombination between the DNA sequences R'/R 1 

19. The cell according to claim 18 which is free from any 
undesired selectable marker gene. 

20. The cell according to claim 19, which is free from a DNA 
sequence encoding a selectable antibiotic-resistance marker. 

„ 25 21 . The cell according to any of claims 18-20, which is a cell 
of a gram-positive bacterium. 

22. The cell according to claim 21, which is a cell of a 
Bacillus sp. or a Lactobacillus sp. 
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23* The cell according to claim 22, which is a cell of Bacillus 
sp. selected from the group consisting of Bacillus subtilis . 
Bacillus licheniformis , Bfl<?iUv*s lentVis, ^acillus brevjs, 
Bacillus stearothennoohilus . Bacillus alkaloohilus , Bacillus 
5 amvlolicruefaciens . Bacillus coaaulans. Bacillus circulans, 
Bacillus lautus . Bacillus meoaterium . Bacillus thurincriensis. 

24. The cell according to any of claims 18-23, wherein the DNA 
sequence P encodes a polypeptide of interest. 

25. The cell according to claim 24, wherein the DNA sequence P 
io encodes a translocated polypeptide. 

26. The cell according to claim 20, wherein the DNA sequence P 
encodes a secreted polypeptide. 

27. The cell according to claim 24, wherein the polypeptide is 
an enzyme. 

15 28. The cell according to claim 27, wherein the enzyme is 
selected from the group consisting of an amylolytic enzyme, a 
lipolytic enzyme, a proteolytic enzyme, a cellulytic enzyme, an 
oxidoreductase or a plant cell-wall degrading enzyme. 

29. A method of constructing a bacterial cell, which in its 
20 genomic DNA has integrated more than one copy of a DNA sequence 
of interest, and which is free from a DNA sequence encoding an 
undesired selectable marker, which method comprises 

a) introducing a first vector comprising a DNA construct 
according to any of claims 1, 3 and 6-14 comprising the struc- 
25 ture IR(1)-P-R-M2-R-IR(2) or IR(1) -R-M2-R-P-IR(2) in associa- 
tion with with a tranposase gene T and optionally a selectable 
marker gene Ml into the host cell, 
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b) selecting for cells being Ml*, M2% which in their genome 
comprises the structure IR(i) -P-R-M2-R-IR(2) or IR(1) -R-M2-R-P- 
IR(2), 

c) introducing a second vector comprising a DNA sequence 
5 encoding a site specific recombinase into the cells selected in 

step b) so as to excise the structure R-M2 or M2-R from the 
genome of the cell whereby cells are obtained having integrated 
the structure IR(1) -R-P-IR(2) or IR(1) -p-r-ir (2 ) , 

d) curing the cells resulting from step c) for the second 
io plasmid, and optionally 

e) repeating steps a-d one or more times to produce bacterial 
cells comprising one or more additional copies of the structure 
IR(1)-R-P-IR(2) or IR(1)-P-R-IR(2) . 

30. A method of constructing a bacterial cell, which in its 
15 genomic DNA has integrated more than one copy of a DNA sequence 
of interest, and which is free from a DNA sequence encoding an 
undesired selectable marker, which method comprises 

a) introducing a first vector comprising a DNA construct 
according to any of claims 2, 3 and 6-14 comprising the struc- 

20 ture IR(1)-R-M2-T-R-P-IR(2) , IR(1) -P-R-M2-T-R-IR(2) , IR(1)-R-T- 
M2-R-P-IR(2) or IR(l) -P-R-T-M2-R-IR(2) optionally in associa- 
tion with a selectable marker gene Ml into the host cell, 

b) selecting for cells being Ml", M2\ which in their genome 
comprises one of the structures identified in a) , 

25 c) selecting for cells having an increased number of copies of 
the marker gene M2, 

d) introducing a second vector comprising a DNA sequence 
encoding a site specific recombinase into the cells selected in 
step b) so as to excise the structure R-M2-T, R-T-M2, M2-T-R or 
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T-R-M2 from the genome of the cell whereby cells are obtained 
having integrated the structure IR(1) -R-P-IR(2) or IR(1)-P-R- 
IR(2), 

e) curing the cells resulting from step d) for the second 
5 plasmid, and optionally 

f) repeating steps a-e one or more times to produce bacterial 
cells comprising one or more additional copies of the structure 
IR(1)-R~P-IR(2) or IR(1)-P-R-IR(2) . 

31. The method according to claim 29 or 30, wherein the second 
10 plasmid is curable. 

32. A method for constructing a bacterial cell, which in its 
genomic DNA has integrated more than one copy of a DNA sequence 
of interest, and which is free from a DNA sequence encoding an 
undesired selectable marker, which method comprises 

15 a) introducing a first vector comprising a DNA construct 
according to any of claims 4 and 6-14 comprising the structure 
IR(1)-P-R f -M2-R"-IR(2) or IR(1) -R , -M2-R ,i -P-IR(2) in association 
with a tranposase gene T and optionally a selectable marker 
gene Ml into the host cell, in which R 1 and r" denote parallel 

20 repeat sequences, 

b) selecting for cells being Ml", M2*, which in their genome 
comprises the structure IR(1) -P«R , -M2-R ,, -IR(2) or IRUJ-R 1 -^- 
R"-P-IR(2) , 

c) allowing homologous recombination between the DNA sequences 
25 R' and R" to take place so as to excise the selectable marker 

gene M2 , whereby cells are obtained having integrated the 
structure IR(1)-R7R' '-P-IR(2) or IR(1)-P-RVR' '-IR(2) (wherein 
R'/R 11 denotes the common recombination sequence), and op- 
tionally 
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d) repeating steps a-c one or more times to produce bacterial 
cells comprising one or more additional copies of the DNA 
structure IR(1)-RVR' f -P-IR(2) or IR(1) -P-R ! /R' 1 -IR(2) . 

33. A method for constructing a bacterial cell, which in its 
5 genomic DNA has integrated more than one copy of a DNA sequence 

of interest, and which is free from a DNA sequence encoding an 
undesired selectable marker, which method comprises 

a) introducing a first vector comprising a DNA construct 
according to any of claims 5-14 comprising the structure IR(l)- 

10 R'-Jte-T-R' '-P-IR(2) , IR(1)-P-R'-M2-T-R" '-IR(2) , IR(1) -R' -T-M2- 
R"-P-IR(2) or IR(1)-P-R'-T-M2-R' »-IR(2) optionally in associa- 
tion with a selectable marker gene Ml into the host cell, in 
which R* and R" denote parallel repeat sequences, 

b) selecting for cells being Ml*, M2\ which in their genome 
15 comprises the relevant structure identified in a) , 

c) selecting for cells having an increased number of copies of 
the selectable marker gene M2, 

d) allowing homologous recombination between the DNA sequences 
R' and r" to take place so as to excise the selectable marker 

20 gene M2 and the transposase gene T, whereby cells are obtained 
having integrated the structure IRflJ-R'/R 1 '-P-IR(2) or IR(l)- 
P-R'/R 1 ' -IR(2) (wherein R'/R 1 1 denotes the common recombination 
sequence) , and optionally 

e) repeating steps a-d one or more times to produce bacterial 
25 cells comprising one or more additional copies of the DNA 

structure IR(1) -R'/R' 1 -P-IR<2) or IR(1) -P-R'/R' '-IR(2) . 

34. The method according to claim 32 or 33, wherein the 
selectable marker gene M2 is a counterselectable marker, such 
as a amdS gene or a thy gene, and wherein, subsequent to step 
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C ) selection is made for the absence of the counterselectable 
marker. 

35. The method according to any of claims 29-34, wherein step 
a) is accomplished by conjugation, the first vector additional- 

5 ly comprising a cis-acting DNA seguence reguired for the 
transfer of the vector into the bacterial cell by conjugation 
in the presence of a trans-acting mobilizing element, the first 
vector being present in a population of donor cells further 
comprising at least one DNA seguence encoding said trans-acting 

10 mobilizing element, the popolation of donor cells being mixed 
with a population of recipient bacterial cells under conditions 
allowing the vector to be transferred from the population of 
the donor cells to the population of recipient cells by con- 
jugation. 

15 36. The method according to any of claims 29-35, wherein the 
DNA construct introduced according to step a) further comprises 
an origin of replication. 

37. The method according to claim 36, wherein the origin of 
replication is conditional. 

20 38. The method according to claim any of claims 29-37, in which 
steps a)-e) is repeated one or more times. 

39. The method according to any of claims 29-38, wherein the 
DNA sequence R and the DNA encoding the site-specific recombi- 
nase are derived from plasmid pAM/31. 

25 40. The method according to any of claims 29-39, wherein the 
integrated DNA construct IR(1) -P-IR(2) , optionally further 
comprising R or R'/R' 1 * is amplified so as to obtain cells 
comprising two or more copies of said DNA construct. 

41. The method according to any of claims 29-4 0, wherein the 
30 cell to be constructed is a cell of a gram-positive bacterium. 
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42. The method according to claim 41, wherein the cell is a 
cell of a Bacillus sp. or a Lactobacillus sp. 

43. The method according to claim 42, wherein the cell is a 
cell of 3acj.llus sp. selected from the group consisting of 

5 BftcjUus subtjljs, Bacillus lichenif ormis . Bacillus lentus . 
Bafiilllis frrevjs, Bacillus stearothermophilus. Bacillus alkalo- 

PhilU?, Bacillus amvlol ia uef aciens , Bacillus coaaulans . 

BagU^us cjrculans, Bacillus lautug r Bacillus mecraterium . Ba- 
cillus thurinaiensis. 

io 44. A method of producing a protein of interest encoding by a 
DNA sequence P, which method comprises cultivating a cell 
according to any of claims 18-28 and/or produced by the method 
according to any of claims 29-43, which cell comprises more 
than one copy of the DNA sequence P, under conditions suitable 

is for producing the polypeptide, and recovering the polypeptide 
from the resulting cell broth. 

45. A marker-free cell of a gram-positive bacterium, in 
particular of a Bacillus sp., comprising muliple copies of a 
DNA sequence P of interest. 
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